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FIBER SHAFT MODIFICATIONS FOR EFFICIENT TARGETING 
RELATED APPLICATIONS 

Benefit of priority is claimed to U.S. provisional application Serial No. 
60/350,388, filed 24 January 2002, entitled "FIBER SHAFT MODIFICATIONS 
5 FOR EFFICIENT TARGETING," to Stevenson, Susan C., Kaleko, Michael, Smith, 
Theodore and Nemerow, Glen R., and to U.S. provisional application Serial No. 
60/391,967, filed 26 June 2002, entitled "FIBER SHAFT MODIFICATIONS FOR 
EFFICIENT TARGETING," to Stevenson, Susan C., Kaleko, Michael, Smith, 
Theodore and Nemerow, Glen R. This application is also related to International 
10 PCT application No. (attorney docket number 22908-1236), filed the same day 
herewith, entitled "FIBER SHAFT MODIFICATIONS FOR EFFICIENT 
TARGETING," to Stevenson, Susan C., Kaleko, Michael, Smith, Theodore and 
Nemerow, Glen R. Where permitted, the subject matter of each of these 
applications is incorporated by reference herein. 

15 FIELD OF INVENTION 

The present invention generally relates to the field of adenoviral vectors 
and the production of such vectors. In particular, detargeted adenoviral vectors 
are provided. 

BACKGROUND 

20 Most, if not all, adenoviral vector-mediated gene therapy strategies aim to 

transduce a specific tissue, such as a tumor or an organ. Systemic delivery will 
require ablation of the normal virus tropism as well as addition of new 
specificities. Multiple interactions between adenoviral particles and the host cell 
are required to promote efficient cell entry (Nemerow (2000) Virology 274:1-4). 
25 An adenovirus entry pathway is believed to involve two separate cell surface 
events. First, a high affinity interaction between the adenoviral fiber knob and 
coxsackie-adenovirus receptor (CAR) mediates the attachment of the adenovirus 
particle to the cell surface. A subsequent association of penton with the cell 
surface integrins a v 0 3 and ajff 5 , which act as co-receptors, potentiates virus 
30 internalization. There are a plurality of adenoviral fiber receptors, which interact 
with the group B (e.g., Ad3) and group C (e.g., Ad5) adenoviruses. Both of 
these groups of adenoviruses appear to require interaction with integrins for 
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need to develop adenoviruses which are fully detargeted in vivo for use as a 
base vector for producing redirected adenoviruses. 

Therefore, among the objects herein, it is an object herein to provide fully 
detargeted adenoviral vectors, methods for preparation thereof, and uses 
5 thereof. 

SUMMARY 

Detargeted and fully detargeted adenoviral particles, adenovirus vectors 
from which such particles are produced, methods for preparation of the vectors 
and particles and uses of the vectors and particles are provided. Provided and 
10 described are capsid modifications, such as fiber shaft modifications, and the 
resuling proteins that, when expressed on adenoviral particles provide for 
detargeting of adenoviral vectors. The capsid modifications, such as the fiber 
shaft modifications, can be combined with other modifications, such as fiber 
knob and/or penton modifications, to produce fully ablated (detargeted) 

15 adenoviral particles. Thus, adenoviral vectors and adenovirual particles whose 
native tropisms are ablated through a modification or modifications of capsid 
proteins, particularly a fiber shaft region, are provided. 

Thus, provided are capsid mutiations, including fiber shaft modifications, 
that ablate binding to particular receptors, thereby permitting efficient targeting 
20 of adenoviral vectors that contain capsids with such modifications. For example, 
adenoviral vectors in which the fiber shaft's interaction with HSP is ablated 
(reduced or substantially eliminated), particuarly in vivo, are provided. These 
fiber shaft modifications can be combined with other modifications, such as fiber 
knob and/or penton modifications, to produce fully ablated (detargeted) 

25 adenoviral vectors. Also provided are retargeted vectors and particles that 

include a ligand or ligands to provide for targeting of the detargeted vectors and 
particles to selected cells and/or tissues. Retargeting can be effected, for 
example, by manipulating the fiber protein to redirect the receptor specificity to a 

particular cell type. 

30 Also provided are nucleic acids encoding the modified fiber proteins and 

also modified penton proteins. Also provided are nucleic acids encoding the 
modified fiber shaft protein that has ablated HSP binding and combinations 
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mutations of individual amino acids in the fiber shaft that interact with HSP or 
mutations of amino acids in the fiber shaft that modify the ability of the HSP 
binding motif to interact with HSP. Adenoviral fiber shaft modifications also 
include replacements of fiber shafts using fiber shafts of adenoviruses, such as, 

5 for example, Ad3, Ad35 and Ad41 short fiber shaft, that do not contain HSP 
binding sites. 

Also provided are adenoviral fiber shaft modifications that alter, 
particularly ablate viral interaction with HSP, as described above, in combination 
with fiber knob modifications that ablate viral interaction with CAR. The fiber 
10 knob modifications include: (a) mutations of individual amino acids in the fiber 
loop that interact with CAR, such as, for example, AB or CD loop modifications; 
(b) mutations of individual amino acids in the fiber loop that modify the ability of 
the CAR binding motif to interact with CAR; and (c) replacements of fiber knobs 
using adenoviruses that do not interact with CAR, such as, for example, Ad3 
15 fiber knob, Ad41 short fiber knob, or Ad35 fiber knob. 

Also provided are adenoviral fiber shaft modifications as described above 
in combination with penton modifications that ablate viral interaction with a v 
integrins. The penton modifications include: (a) mutations of individual amino 
acids that interact with a v integrins; (b) mutations of individual amino acids that 
20 modify the ability of the a v integrin binding motif to interact with the cr v integrins, 
and (c) replacement of penton proteins using penton proteins from adenoviruses 

that do not interact with the a v integrins. 

Also provided are adenoviral fiber shaft modifications as described above 
in combination with fiber knob modifications as described above and penton . 

25 modifications as described above. 

Also provided is a scale-up method for the propagation of detargeted 
adenoviral vectors. The method uses polycations and/or bifunctional reagents, 
which when added to tissue culture medium results in entry of adenoviral 
particles into the producer cells. 
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Ad5 or Ad 2 fiber, to produce a complete fiber whose binding to HSP is reduced 
or eliminated. 

All of the modified capsids proteins provided herein also can include one 
or more further modifications that reduce or eliminate interaction of the resulting 
fiber with one or more cell surface proteins, such as but not limited to, CAR and 
cr u integrin or other receptor to which a particular native fiber binds, in addition to 
HSP. These modifications include, but are not limited to, modification to fiber 
that reduces or eliminates CAR binding and modification to penton that reduces 
or eliminate a v integrin binding. The mutations can be in the fiber knob, shaft, 
tail and shaft, and also in penton. 

Any and all of the modified capsid proteins provided herein can further 
include a .ligand that binds to a particular receptor thereby endowing a fiber (or 
other capsid protein) with binding specificity or the ability to interact with such 
receptor. The ligand can be inserted into any suitable site in a capsid protein, 
such as an insertion or replacement. For example, fibers with ligands inserted 
into the knob region are exemplified. Any such ligand can be employed and a 
variety are exemplified herein. 

A variety of modified capsid proteins are exemplified herein. These 
include, but are not limited to, fibers containing: the sequence of amino acids 
set forth in any of SEQ ID Nos. 52, 54, 56, 58, 62, 66, 7.0 and 72; or a 
sequence of amino acids having 60%, 70%, 80%, 90%, 95% or greater 
sequence identity with a sequence of amino acids set forth in any of SEQ ID 
Nos. 52, 54, 56, 58, 62, 66, 70 and 72; or a sequence of amino acids encoded 
by a sequence of nucleotides that hybridizes under conditions of high stringency 
along at least 70% of its length to a sequence of nucleotides that encodes a 
sequence of amino acids set forth in any of SEQ ID Nos. 52, 54, 56, 58, 62, 66, 
70 and 72. 

Nucleic acids encoding the capsid proteins, including the fibers are also 
provided. The nucleic acids can be provided as vectors, particularly as 
adenovirus vectors. Many adenoviral vectors are known and can be modified as 
needed in accord with the description herein. Adenoviral vectors include, but 
are not limited to, early generation adenoviral vectors, such as El -deleted 
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prokaryotic cells, but typically are eukaryotic cells, including mammalian cells, 
such as primate, including human, cells. The cells can be of a specific type, such 
as a tumor cell or a cell in a particular tissue. The vectors can be oncolytic 
vector to effect killing of tumor cells. 

5 Since the modified capsid proteins herein have reduced or eliminated 

binding to HSP, viral particles containing such proteins exhibit ablated binding to 
HSP in vitro and in vivo. Thus provided is a method of reducing transduction of 
cells that express HSP, such as hepatocytes in the liver, by modifying a capsid 
protein, such as fiber to eliminate or reduce interaction with or binding to HSP. 

10 Such reduction reduces or eliminates transduction of cells that express HSP, 
including liver cells. 

Also provided are scale-up methods for the propagation of detargeted 
adenoviral particle, such as those provided herein. The method includes the 
steps of infecting or transducing a cell capable of replicating, maturing and 
15 packaging an adenoviral vector with a detargeted adenoviral vector in the 

presence of a reagent that results in entry of the adenoviral particle into the cell, 
such as a polycation and/or a bifunctional protein or other such reagent; and 
culturing the infected cell under conditions suitable for growth, spread and 
propagation of the adenoviral vector. The resulting adenoviral particles can be 
20 recovered. Polycations include, but are not limited to, hexadimethrine bromide, 
polyethylenimine, protamine sulfate and poly-L-lysine. Bifunctional proteins, 
include, but are not limited to, an anti-fiber antibody ligand fusion, an 
anti-fiber-Fab-FGF conjugate, an anti-penton-antibody ligand fusion, an 
anti-hexon antibody ligand fusion and a polylysine-peptide fusion. The ligand is 

25 selected to bind to a particular receptor. 

The viral particles that express a modified capsids provided herein can be 
produced by this method. The modification include, for example, one or more 
mutations selected from among mutations that reduce or eliminate interactions 
with one or more of a v integrins, coxsackie-adenovirus receptors (CAR) and 
30 heparin sulfate proteoglycans (HSP). Such mutations include, for example, PD1, 
KOI, KOI 2 and S*. 
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Figures 14A-14B shows the influence of fiber shaft mutations on in vivo 
adenoviral-mediated liver gene expression (Fig. 14A) and hexon DNA content 
(Fig. 14B). 

Figures 15A-15B are plasmid maps of pSQIHSPRGD (Fig. 15A) and 
5 pSQI HSPKOI RGD (Fig. 15B). 

Figure 16 shows that insertion of a RGD targeting ligand can restore 
transduction of the vectors containing the HSP binding shaft S* mutation. 

Figures 17A-17B are plasmid maps of pSQI AD35Fiber (Fig. 17A) and 
pSQI Ad35FcRGD (Fig. 17B). 

10 Figures 18A-18B are maps of plasmids encoding 35F chimeric fibers. Fig. 

18A is a plasmid map of pSQ135T5H, and Fig. 18B is a plasmid map of 
pSQI 5T35H. 

Figure 1 9 shows the results of an in vitro analysis of Ad5 vectors 
containing Ad35 fibers and derivatives thereof. 

15 Figure 20 shows the results of an in vivo analysis of Ad5 vectors 

containing Ad35 fibers and derivatives thereof. 

Figures 21A-21B are plasmid maps of pSQ1Ad41sF (Fig. 21A) and 
pSQI Ad41sFRGD (Fig. 21 B). 

Figure 22 shows the results of an in vivo analysis of Ad5 vectors 
20 containing Ad41 short fiber. 

Figure 23 shows the in vitro analysis of Ad5 based vectors containing the 
Ad41 short fiber which has been re-engineered to contain a cRGD ligand in the 
HI loop. 

Figure 24 shows enhanced transduction of AE1-2a cells with the 
25 Av3nBgFK01 detargeted adenoviral vector using hexadimethrine bromide (HB), 
protamine sulfate (PS) and poly-lysine-RGD (K14) or the anti-penton-TNFo 
bifunctional protein (apen-TNF). 

Figure 25 shows ablation of HSP interaction decreases 
adenoviral-mediated gene transfer to other organs 
3 q Figure 26 shows in vivo liver transduction with adenoviral vectors which 

encode for B-galactosidase and contain various mutations to the fiber and/or 
penton proteins. Results are plotted as percent transduction as compared to 
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As used herein, "virus," "viral particle," "vector particle," "viral vector 
particle," and "virion" are used interchangeably to refer to infectious viral 
particles that are formed when, such as when a vector containing all or a part of 
a viral genome, is transduced into an appropriate cell or cell line for the 
5 generation of such particles. The resulting viral particles have a variety of uses, 
including, but not limited to, transferring nucleic acids into cells either in vitro or 
in vivo. For purposes herein, the viruses are adenoviruses, including recombinant 
adenoviruses formed when an adenovirus vector, such as any provided herein, is 
encapsulated in an adenovirus capsid. Thus, a viral particle is a packaged viral 
10 genome. An adenovirus viral particle is the minimal structural or functional unit 
of a virus. A virus can refer to a single particle, a stock of particles or a viral 
genome. The adenovirus (Ad) particle is relatively complex and may be resolved 
into various substructures. 

Included among adenoviruses and adenoviral particles are any and all 
15 viruses that can be categorized as an adenovirus, including any adenovirus that 
infects a human or an animal, including all groups, subgroups, and serotypes. 
Thus, as used herein, "adenovirus" and "adenovirus particle refer to the virus 
itself and derivatives thereof and cover all serotypes and subtypes and naturally 
occurring and recombinant forms, except where indicated otherwise. Included 
20 are adenoviruses that infect human cells. Adenoviruses can be wildtype or can 
be modified in various ways known in the art or as disclosed herein. Such 
modifications include, but are not limited to, modifications to the adenovirus 
genome that is packaged in the particle in order to make an infectious virus. 
Exemplary modifications include deletions known in the art, such as deletions in 
25 one or more of the El a, Elb, E2a, E2b, E3, or E4 coding regions. Other 

exemplary modifications include deletions of all of the coding regions of the 
adenoviral genome. Such adenoviruses are known as gutless adenoviruses. 
The terms also include replication-conditional adenoviruses, which are viruses 
that preferentially replicate in certain types of cells or tissues but to a lesser 
30 degree or not at all in other types. For example, among the adenoviral particles 
provided herein, are adenoviral particles that replicate in abnormally proliferating 
tissue, such as solid tumors and other neoplasms. These include the viruses 
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”mask" the molecule and/or increase half-life, or conjugated to a non-viral 
protein. 

As used herein, oncolytic adenoviruses refer to adenoviruses that 
replicate selectively in tumor cells 

5 As used herein, a variety of vectors with different requirements and 

purposes are described. For example, one vector is used to deliver particular 
nucleic acid molecules into a packaging cell line for stable integration into a 
chromosome. These types of vectors also are referred to as complementing 
plasmids. A further type of vector carries or delivers nucleic acid molecules in or 
10 into a cell line (e.g., a packaging cell line) for the purpose of propagating viral 

vectors; hence, these vectors also can be referred to herein as delivery plasmids. 
A third "type" of vector is the vector that is in the form of a virus particle 
encapsulating a viral nucleic acid and that is comprised of the capsid modified as 
provided herein. Such vectors also can contain heterologous nucleic acid 
15 molecules encoding particular polypeptides, such as therapeutic polypeptides or 
regulatory proteins or regulatory sequences to specific cells or cell types in a 
subject in need of treatment. 

As used herein, the term "motif" is used to refer to any set of amino 
acids forming part of a primary sequence of a protein, either contiguous or 
20 capable of being aligned to certain positions that are invariant or conserved, that 
is associated with a particular function. The motif can occur, not only by virture 
of the primary sequence, but also as a consequence of three-dimensional folding. 
For example, the motif GXGXXG is associated with nucleotide-binding sites. In 
this fiber is a trimer, hence the trimeric structure can contribute fornation of a 
25 motif. Alternatively, a motif can be considered as a domain of a protein, where 
domain is a region of a protein molecule delimited on the basis of function 
without knowledge of and relation to the molecular substructure, as, e.g., the 
part of a protein molecule that binds to a receptor. As shown herein, the motif 
KKTK constitutes a consensus sequence for fiber shaft interaction with HSP. 

30 As used herein, the term "bind" or "binding" is used to refer to the 

binding between a ligand and its receptor, such as the binding of an Ad5 shaft 
motif with HSP (Heparin Sulfate Proteoglycans), with a K d in the range of 10-2 
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As used herein, the term "substantially eliminated" refers to a 
transduction efficiency less than about 1 1 % of the efficiency of the wild-type 
fiber containing virus on HeLa cells. The transduction efficiency on Hela cells 
can be measured (see, e.g., Example 1 of U.S. Patent Application Serial No. 

5 09/870,203 filed on 30 May 2001, and published as U.S. Published application 

No. 20020137213, and of International Patent Application No. PCT/EP01 /06286 
filed 1 June 2001). Briefly, HeLa cells are infected with the adenoviral vectors 
containing mutated fiber proteins to evaluate the effects of fiber amino acid 
mutations on CAR interaction and subsequent gene expression. Monolayers of 
10 HeLa cells in 1 2 well dishes are infected with, for example, 1 000 particles per 
cell for 2 hours at 37° C. in a total volume of, for example, 0.35 ml of the 
DMEM containing 2% FBS. The infection medium is then aspirated from the 
monolayers and I ml of complete DMEM containing 10% FBS was added per 
well. The cells are incubated for an sufficient time, generally about 24 hours, to 
15 allow for 0-galactosidase expression, which is measured by a chemiluminescence 
reporter assay and by histochemical staining with a chromogenic substrate. The 
relative levels of £-galactosidase activity are determined using as suitable 
system, such as the Galacto-Light chemiluminescence reporter assay system 
(Tropix, Bedford, Mass.) Cell monolayers are washed with PBS and processed 
20 according to the manufacturer's protocol. The cell homogenate is transferred to 
a microfuge tube and centrifuged to remove cellular debris. Total protein 
concentration is determined, such as by using the bicinchoninic acid(BCA) 
protein assay (Pierce, Inc., Rockford, III.) with bovine serum albumin as the 
assay standard. An aliquot of each sample is then incubated with the Tropix 
25 jff-galactosidase substrate for 45 minutes in a 96 well plate. A luminometer is 
used determine the relative light units (RLU) emitted per sample and then 
normalized for the amount of total protein in each sample (RLU/ug total protein). 
For the histochemical staining procedure, the cell monolayers are fixed with 
0.5% glutaraldehyde in PBS, and then were incubated with a mixture of 1 mg of 
30 5-bromo-4-chloro-3-indolyl-/?-D-galactoside (X-gal) per ml, 5 mM potassium 

ferrocyanide, 5 mM potassium ferricyanide and 2 mM MgCI 2 in 0.5 ml of PBS. 
The monolayers are washed with PBS and the blue cells are visualized by light 
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provide complementing functions for the genes deleted in an adenoviral genome 
(e.g., the nucleic acids encoding modified fiber proteins) and are able to package 
the adenoviral genomes into the adenovirus particle. The production of such 
particles require that the genome be replicated and that those proteins necessary 
5 for assembling an infectious virus are produced. The particles also can require 
certain proteins necessary for the maturation of the viral particle. Such proteins 
can be provided by the vector or by the packaging cell. 

As used herein, detargeted adenoviral particles have ablated (reduced or 
eliminated) interaction with receptors with which native particles. Fully 
10 detargeted particles have two or more specificities altered. It is understood that 
in vivo no particles are fully ablated such that they do not interact with any cells. 
Degareted and fully degarated have reduced, typically substantial! reduced, or 
eliminated interaction with native receptors. For purposes herein, detargeted 
particles have reduced (2-fold, 5-fold, 10-fold, 100-fold or more) binding or 
15 virtually no binding to HSP receptors; fully degareted vectors include further 

capsid modifications to eliminate interactions with other receptors, such as CAR 
and integrins or other receptors. The particles still bind to cells, but the types of 
cells and interactions are reduced. 

As used herein, pseudotyping describes the production of adenoviral 
20 vectors having modified capsid protein or capsid proteins from a different 

serotype than the serotype of the vector itself. One example, is the production 
of an adenovirus 5 vector particle containing an Ad37 or Ad35 fiber protein. 

This can be accomplished by producing the adenoviral vector in packaging cell 
lines expressing different fiber proteins. As provided herein, detargeting of an 
25 adenovirus 5 particle or other serotype group C adenovirus or other adenovirus 
that binds to FISP to reduce or eliminate binding to FISPs can be effected by 
replacing all or a portion that includes the shaft or at least the HSP consensus 
binding sequence of the Ad5 fiber with an adenovirus fiber or portion thereof 
that does not bind to HSP. Adenoviruses having fiber shafts that do not interact 
30 with HSP include (a) adenoviruses of subgroup B, e.g., Ad3, Ad35, Ad7, Adi 1, 
Adi 6, Ad21, Ad34 which do not have interaction with HSP, (b) adenoviruses of 
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substitution as follows: R512S, A515G, E516G, and K517G. Other KO 
mutations can be identified empirically or are known to those of skill in the art. 

As used herein, PD mutations refer to mutations in the penton gene that 
ablate binding by the encoded to a v integrin by replacing the RGD tripeptide. 

5 The PD1 mutation exemplified herein results in a substitution of amino acids 337 
through 344 of the Ad5 penton protein, HAIRGDTF (SEQ ID No. 9), with amino 
acids SRGYPYDVPDYAGTS (SEQ ID No. 10), thereby replacing the RGD 
tripeptide. 

As used herein, treatment means any manner in which the symptoms of a 
10 condition, disorder or disease are ameliorated or otherwise beneficially altered. 

As used herein, a therapeutically effective product is a product that is 
encoded by heterologous DNA that, upon introduction of the DNA into a host, a 
product is expressed that effectively ameliorates or eliminates the symptoms, 
manifestations of an inherited or acquired disease or that cures said disease. 

*15 As used herein, a subject is an animal, such as a mammal, typically a 

human, including patients. 

As used herein, genetic therapy involves the transfer of heterologous 
DNA to the certain cells, target cells, of a mammal, particularly a human, with a 
disorder or conditions for which such therapy is sought. The DNA is introduced 
20 into the selected target cells in a manner such that the heterologous DNA is 
expressed and a therapeutic product encoded thereby is produced. 

Alternatively, the heterologous DNA may in some manner mediate expression of 
DNA that encodes the therapeutic product, it may encode a product, such as a 
peptide or RNA that in some manner mediates, directly or indirectly, expression 
25 of a therapeutic product. Genetic therapy may also be used to deliver nucleic 
acid encoding a gene product to replace a defective gene or supplement a gene 
product produced by the mammal or the cell in which it is introduced. The 
introduced nucleic acid may encode a therapeutic compound, such as a growth 
factor inhibitor thereof, or a tumor necrosis factor or inhibitor thereof, such as a 
30 receptor therefor, that is not normally produced in the mammalian host or that is 
not produced in therapeutically effective amounts or at a therapeutically useful 
time. The heterologous DNA encoding the therapeutic product may be modified 
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program package (Devereux, J., eta/.. Nucleic Acids Research 72///:387 
(1984)), BLASTP, BLASTN, FASTA (Atschul, S.F., etaL.J Molec Biol 275:403 
(1990); Guide to Huge Computers, Martin J. Bishop, ed., Academic Press, San 
Diego, 1994, and Carillo et at. (1988) SIAM J Applied Math 45:1073). For 
5 example, the BLAST function of the National Center for Biotechnology 

Information database can be used to determine identity. Other commercially or 
publicly available programs include, DNAStar "MegAlign" program (Madison, Wl) 
and the University of Wisconsin Genetics Computer Group (UWG) Gap 
program (Madison Wl)). Percent homology or identity of proteins and/or nucleic 
10 acid molecules can be determined, for example, by comparing sequence 

information using a GAP computer program ( e.g ., Needleman et at. (1970) J. 

Mol. Biol. 48:443, as revised by Smith and Waterman ((1981) Adv. Appl. Math. 
2:482). Briefly, the GAP program defines similarity as the number of aligned 
symbols (i.e., nucleotides or amino acids) which are similar, divided by the total 
15 number of symbols in the shorter of the two sequences. Default parameters for 
the GAP program can include: (1) a unary comparison matrix (containing a value 
of 1 for identities and 0 for non-identities) and the weighted comparison matrix 
of Gribskov et at. (1986) Nuci. Acids Res. 14:6745, as described by Schwartz 
and Dayhoff, eds., ATLAS OF PROTEIN SEQUENCE AND STRUCTURE. National 
20 Biomedical Research Foundation, pp. 353-358 (1979); (2) a penalty of 3.0 for 

each gap and an additional 0.10 penalty for each symbol in each gap; and (3) no 
penalty for end gaps. Therefore, as used herein, the term "identity" represents a 
comparison between a test and a reference polypeptide or polynucleotide. 

As used herein, the term "at least 90% identical to" refers to percent 
25 identities from 90 to 99.99 relative to the reference polypeptides. Identity at a 
level of 90% or more is indicative of the fact that, assuming for exemplification 
purposes a test and reference polynucleotide length of 100 amino acids are 
compared, no more than 10% (i.e., 10 out of 100) of amino acids in the test 
polypeptide differs from that of the reference polypeptides. Similar comparisons 
30 can be made between a test and reference polynucleotides. Such differences 
can be represented as point mutations randomly distributed over the entire 
length of an amino acid sequence or they can be clustered in one or more 
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used. Filters are incubated in hybridization mixture for 18-20 hours at 40°C, 
and then washed for 1 .5 hours at 55°C in a solution containing 2X SSC, 25 mM 
Tris-HCI (pH 7.4), 5 mM EDTA, and 0.1% SDS. The wash solution is replaced 
with fresh solution and incubated an additional 1.5 hours at 60°C. Filters are 
blotted dry and exposed for autoradiography. If necessary, filters are washed for 
a third time at 65-68°C and reexposed to film. Other conditions of low 
stringency which can be used are well known in the art ( e.g ., as employed for 
cross-species hybridizations). 

By way of example and not way of limitation, procedures using 
conditions of moderate stringency include, for example, but are not limited to, 
procedures using such conditions of moderate stringency are as follows: Filters 
containing DNA are pretreated for 6 hours at 55 °C in a solution containing 6X 
SSC, 5X Denhart's solution, 0.5% SDS and 100/yg/ml denatured salmon sperm 
DNA. Hybridizations are carried out in the same solution and 5-20 X 10 6 cpm 
32 P-labeled probe is used. Filters are incubated in hybridization mixture for 18-20 
hours at 55°C, and then washed twice for 30 minutes at 60°C in a solution 
containing IX SSC and 0.1 % SDS. Filters are blotted dry and exposed for 
autoradiography. Other conditions of moderate stringency which can be used 
are well-known in the art. Washing of filters is done at 37 °C for 1 hour in a 

solution containing 2X SSC, 0.1% SDS. 

By way of example and not way of limitation, procedures using conditions 
of high stringency are as follows: Prehybridization of filters containing DNA is 
carried out for 8 hours to overnight at 65 °C in buffer composed of 6X SSC, 

50 mM Tris-HCI (pH 7.5), 1 mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.02% BSA, 
and 500 //g/ml denatured salmon sperm DNA. Filters are hybridized for 48 hours 
at 65 °C in prehybridization mixture containing 100//g/ml denatured salmon 
sperm DNA and 5-20 X 10 6 cpm of 32 P-labeled probe. Washing of filters is done 
at 37°C for 1 hour in a solution containing 2X SSC, 0.01 % PVP, 0.01 % Ficoll, 
and 0.01% BSA. This is followed by a wash in 0.1X SSC at 50°C for 45 
minutes before autoradiography. Other conditions of high stringency which can 
be used are well known in the art- n 
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Academic Press, Inc., Harcourt Brace Jaovanovich, NY; Nucleic Acid 
Hybridization (B. D. Hames & S. J. Higgins eds. 1984); Culture Of Animal Cells 
(R. |. Freshney, Alan R. Liss, Inc., 1987); Immobilized Cells And Enzymes (IRL 
Press, 1986); B. Perbal (1984), A Practical Guide To Molecular Cloning) Gene 
5 Transfer Vectors For Mammalian Cells (J. H. Miller and M. P. Calos eds., 1987, 
Cold Spring Harbor Laboratory); Methods In Enzymology, Vols. 154 and 155 
(Wu et al. eds.); Immunochemical Methods In Cell And Molecular Biology (Mayer 
and Walker, eds.. Academic Press, London, 1987); Handbook Of Experimental 
Immunology, Volumes l-IV (D. M. Weir and C. C. Blackwell, eds., 1986); Hogan 
10 et at. (1986) Manipulating the Mouse Embryo, Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, N.Y. 

B. Capsid modifications 

Provided herein are modifications of the viral capsid that ablate the 
interaction of an adenovirus with its natural receptors. In particular, fiber 
15 modifications that result in ablation of the interaction of an adenvirus with HSP 
are provided. These fiber modifications can be combined with other capsid 
protein modifications, such as other fiber modifications and/or penton and/or 
hexon modifications, to fully ablate viral interactions with natural receptors, 
when expressed on a viral particle. The modification should not disrupt trimer 
20 formation or transport of fiber into the nucleus. 

1 . Fiber genes and proteins 

The fiber protein extends from the capsid and mediates viral binding to 
the cell surface by binding to specific cell receptors (Philipson et al. (1968) J. 
Virol. 2:1064-1075). The fiber is a trimeric protein that includes an N-terminal 
25 tail domain that interacts with the adenovirus penton base, a central shaft 

domain of varying length, and a C-terminal knob domain that contains the cell 
receptor binding site (Chroboczek et at. (1995) Curr.Top. Microbiol. Immunol. 
755:163-200; Riurok et al. (1 990) J. Mol. Biol. 275:589-596; Stevenson et al. 
(1995) J. Virol. 69: 2850-2857). The sequences of the fiber gene from a variety 
30 of serotypes including adenovirus serotypes 2 (Ad2), Ad5, Ad3, Ad35, Adi 2, 
Ad40, and Ad41 are known. There are at least 21 different fiber genes in 



Genbank. 
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other mutations that alter the structure of the fiber shaft such that the HSP 
binding of the modified fiber protein is ablated when compared to the HSP 

binding of the wild-type fiber protein. 

In a first aspect of this embodiment, an adenoviral fiber protein is 
5 modified by mutating one or more of the amino acids that interact with HSP. 

For example, the HSP binding motif of the modified fiber protein is no longer able 
to interact with HSP on the cell surface, thus ablating the viral interaction with 
HSP. For example, the adenoviral fiber is from a subgroup C adenovirus. 

Binding to HSP can be eliminated or reduced by mutating the fiber shaft in order 
10 to modify the ability of the HSP binding motif, which is, for example, KKTK 

sequence (SEQ ID No. 1) located between amino acid residues 91 to 94 in the 
Ad 5 fiber, to interact with HSP. The fiber proteins are modified by chemical 
and biological techniques known to those skilled in the art, such as site directed 
mutagenesis of nucleic aicd encoding the fiber or other techniques as illustrated 

15 herein. 

In another aspect of this embodiment, the ability of a fiber to interact 
with HSP is modified by replacing the wild-type fiber shaft with a fiber shaft, or 
portion thereof, of an adenovirus that does not interact with HSP to produce 
chimeric fiber proteins. The portion is sufficient to reduce or eliminate 
20 interaction with HSP. Examples of adenoviruses having fiber shafts that do not 
interact with HSP include (a) adenoviruses of subgroup B, such as, but are not 
limited to, Ad3, Ad35, Ad7, Adi 1, Adi 6, Ad21, Ad34, which do not have 
interaction with HSP, (b) adenoviruses of subgroup F, such as, but are not 
limited to, Ad40 and Ad41, specifically the short fiber, and (c) adenoviruses of 
25 subgroup D, such as but are not limited to, Ad46. In another embodiment, 
adenoviral fiber shaft modifications that ablate viral interaction with HSP in 
combination with adenoviral fiber knob modifications that ablate viral interactions 
with CAR are provided. Suitable adenoviral fiber modifications include the 
fiber knob modifications are known to those of skill in the art and are exemplified 
30 herein (see, also, US. Patent Application Serial No. 09/870,203, filed on 30 May 
2001, and published as U.S. Published application No. 20020137213, in 
International Patent Application No. PCT/EP0 1/06286 filed on 1 June 2001). 
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prepared using chemical and biological techniques known to those skilled in the 
art and as illustrated herein. Generally the adenovirus is a subgroup B or 
subgroup C adenovirus. 

Preparation of fibers modified to eliminate or reduce HSP interactions and 
fibers modified to alter interactions with other receptors and cell surface 
proteins, such as CAR and/or a v integrin, is also described in the Examples 
below. The nucleic acid and/or amino acid sequences of exemplary modified 
fibers, whose construction are described below) are set forth as SEQ ID Nos. 45- 
72 as follows: 

SEQ ID Nos. 45 and 46 set forth the encoding nucleotide sequence and 
amino acid sequence of the modified fiber designated 5FK01, where 5F refers to 
Adenovirus 5 fiber, KOI is an exemplary mutation of the CAR interaction site 
described herein; 

SEQ ID Nos. 47 and 48 set forth the encoding nucleotide sequence and 
amino acid sequence of the modified ber designated 5FK01RGD, which further 
includes an RGD ligand to demonstrate retargeting; 

SEQ ID Nos. 49 and 50 set forth the encoding nucleotide sequence and 
amino acid sequence of the modified fiber designated 5FK012, where 5F refers 
to Adenovirus 5 fiber, KOI 2 is another exemplary mutation of the CAR 
interaction site described herein; 

SEQ ID Nos. 51 and 52 set forth the encoding nucleotide sequence and 
amino acid sequence of the modified fiber designated 5F S* nuc, where 5F 
refers to Adenovirus 5 fiber, S* is an exemplary mutation of the shaft that alters 
binding to HSP; 

SEQ ID Nos. 53 and 54 set forth the encoding nucleotide sequence and 
amino acid sequence of the modified fiber designated 5F S*RGD nuc, which 
further includes an RGD ligand; 

SEQ ID Nos. 55 and 56 set forth the encoding nucleotide sequence and 
amino acid sequence of the modified ber designated 5FK01S*, which contain 
the KOI and S* mutations; 
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adenoviral vectors that express the modified capsid proteins and produce 
particles with modified fibers, or by packaging adenoviral vectors, particularly 
those that do not encode one or more capsid proteins in appropriate packaging 
lines. Hence, as discussed in detail below, adenoviral vectors and viral particles 
5 with modified fibers that do no bind to HSP are provided. 

C. Nucleic acids. Adenoviral vectors and cells containing the nucleic acids 
and cells containing the vectors 

' Also provided are polynucleotides that encode modified capsid proteins 

and that encode vectors for preparation of adenovirus that express modified 
10 capsid proteins provided herein. The sequences of the wild-type adenovirus 

proteins are well known in the art and are modified as described herein. Nucleic 
acid molecules, such as cDNA that encode an exemplary modified fiber knob for 
ablated CAR interaction (see, SEQ ID No. 2 for KOI and SEQ ID No. 3 for KOI 2) 
and for a modified penton for ablated a v integrins (SEQ ID No. 4) are provided. 

15 As discussed above, modified capsid proteins with altered tropism for CAR and 
a v integrins are known and described in the patents, applications and literature 
cited herein and known to those of skill in the art (see, e.g., U.S. Patent No. 
5,731,190, U.S. application Serial No. 09/870,203, published as U.S. Published 
application No. 20020137213; and Bai et at. (1993) J. Virology 
20 67:5198-5208). 

Also provided are vectors including the polynucleotides provided herein. 
Such vectors include partial or complete adenoviral genomes and plasmids. Such 
vectors are constructed by techniques known to those skilled in the art and as 
illustrated herein. Also provided are adenoviral vectors modified by replacing 
25 whole fiber protein, or portions thereof, with the fiber proteins, or appropriate 
portions thereof, of an adenovirus that does not interact with HSP. 

Adenoviruses that do not interact with HSP can be identified by using the 
methods described herein which detect binding or non-binding of fiber proteins 
and adenoviruses with HSP. Among the adenoviral vectors provided herein are 
30 those of subgroup C, which include Ad2 and Ad5, in which the nucleic acid 
encoding the fiber shaft or a portion including the HSP-binding portion is 
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standard techniques. An exemplary method for producing adenoviral particles 
provided herein is as follows. The nucleic acid encoding the mutated fiber 
protein is made using standard techniques in an adenoviral shuttle plasmid. This 
plasmid contains the right end of the virus, in particular from the end of the E3 
region through the right ITR. This plasmid is co-transfected into competent cells 
of an E. coli strain, such as the well known E. coli strain BJ5183 (see, e.g., 
Degryse (1996) Gene 7 70:45-50) along with a plasmid, which contains the 
remaining portion of the adenovirus genome, except for the El region and 
sometimes also the E2a region and also contains a corresponding region of 
homology. Homologous recombination between the two plasmids generates a 
full-length plasmid encoding the entire adenoviral vector genome. 

This full-length adenoviral vector genome plasmid is then transfected into 
a complementing cell line. The transfection can be performed in the presence of 
a reagent that directs adenoviral particle entry into producer cells. Such 
reagents include, but are not limited to, polycations and bifunctional reagents, 
such as those described herein. A complementing cell is, for example, is a cell 
of the PER.C6 cell line, which contains the adenoviral El gene (PER.C6 is 
available, for example, from Crucell, The Netherlands; deposited under ECACC 
accession no. 96022940; see, also Fallaux eta/. (1998) Hum. Gene Ther. 
9:1909-1907; see, also, U.S. Patent No. 5,994,128) or an AE1-2a cell (see, 
Gorziglia et al. (1996) J. Virology 70:4173-4178; and and Von Seggern et al. 

(1 998) J. Gen. Virol. 79:1461-1468)). 

AE-|-2a cells are derivatives of the A549 lung carcinoma line (ATCC # 
CCL 1 85) with chromosomal insertions of the plasmids pGRE5-2.E1 (also 
referred to as GRE5-E1-SV40-Hygro construct and listed in SEQ ID No. 41) and 
pMNeoE2a-3.1 (also referred to as MMTV-E2a-SV40-Neo construct and listed in 
SEQ ID No. 42), which provide complementation of the adenoviral El and E2a 
functions, respectively. 

The 633 cell line (see, von Seggern et al. (2000) J. Virology 
74:354-362), which stably expresses the adenovirus serotype 5 wild-type fiber 
protein, and was derived from the AE1-2a cell line, is another an example of 
complementing cells. When the cell line is 633 cells, the final passage of 
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types, they can be amplified in cell lines derived from said cell type without 

provision of Ad complementary genes. 

2. Adenoviral vectors and particles 

The adenovirus as used herein for production of the adenoviral vectors 
5 and particles can be of any serotype. Adenoviral stocks that can be employed 
as a source of adenovirus or adenoviral coat protein, such as fiber and/or penton 
base, can be amplified from the adenoviral serotypes 1 through 47, which are 
currently available from the American Type Culture Collection (ATCC, Rockville, 
Md.), or from any other serotype of adenovirus available from any other source. 

10 For instance, an adenovirus can be of subgroup A (e.g., serotypes 12, 18, 31), 
subgroup B (e.g., serotypes 3, 7, 1 1 , 14, 1 6, 21, 34, 35), subgroup C (e.g., 
serotypes 1, 2, 5, 6), subgroup D (e.g., serotypes 8, 9, 10, 13, 15, 17, 19, 20, 
22-30, 32, 33, 36-39, 42-47), subgroup E (serotype 4), subgroup F (serotype 
40, 41), or any other adenoviral serotype. 

15 In certain embodiments, the adenovirus is a subgroup B or a subgroup C 

adenovirus. Subgroup C adenoviruses which are modified in as described herein, 
include, but are not limited to, Ad2 and Ad5. For Ad5, the mutation is made in 
the KKTK sequence (SEQ ID No. 1) located between amino acid residues 91 to 
94. The fiber proteins can be modified by chemical and biological techniques 
20 known to those skilled in the art. These methods include, but are not limited to, 
site directed mutagenesis and techniques as illustrated herein. 

The adenoviral particle generally includes a targeting ligand as described 
above. The presence of the targeting ligand permits the delivery of a gene to a 
desired cell type which is different from the cell type that wild-type adenovirus 
25 particles infect or the same as that a wild-type particle infects, but allowing the 
infection in a selective manner, /.e., non-target cell types are not significantly 

infected. 

The adenoviral vectors provided herein can be used to study cell 
transduction and gene expression in vitro or in various animal models. The latter 
30 case includes ex vivo techniques, in which cells are transduced in vitro and then 
administered to the animal. They also can be used to conduct gene therapy on 
humans or other animals. Such gene therapy can be ex vivo or in vivo. For in 
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the packaging of the adenoviral vector particle. Such gutless adenoviral vector 
particles are recovered by standard techniques. The helper vector genome can 
be delivered in the form of a plasmid or similar construct by standard 
transfection techniques, or it can be delivered through infection by a viral particle 
5 containing the genome. Such viral particle is commonly called a helper virus. 
Similarly, the gutless adenoviral vector genome can be delivered to the cell by 
transfection or viral infection. 

The helper virus genome can be the modified adenovirus vector genome 
as disclosed herein. Such genome also can be prepared or designed so that it 
10 lacks the genes encoding the adenovirus El A and El B proteins. In addition, the 
genome can further lack the adenovirus genes encoding the adenovirus E3 
proteins. Alternatively, the genes encoding such proteins can be present but 
mutated so that they do not encode functional El A, El B and E3 proteins. 
Furthermore, such vector genome can not encode other functional early proteins, 
15 such as E2A, E2B3, and E4 proteins. Alternatively, the genes encoding such 
other early proteins can be present but mutated so that they do not encode 
functional proteins. 

In producing the gutless vectors, the helper virus genome is also 
packaged, thereby producing helper virus. In order the minimize the amount of 
20 helper virus produced and maximize the amount of gutless vector particles 

produced, the packaging sequence in the helper virus genome can be deleted or 
otherwise modified so that packaging of the helper virus genome is prevented or 
limited. Since the gutless vector genome will have an unmodified packaging 

sequence, it will be preferentially packaged. 

25 One way to do this is to mutate the packaging sequence by deleting one 

or more of the nucleotides comprising the sequence or otherwise mutating the 
sequence to inactivate or hamper the packaging function. One exemplary 
approach is to engineer the helper genome so that recombinase target sites flank 
the packaging sequence and to provide a recombinase in the packaging cell. The 
30 action of recombinase on such sites results in the removal of the packaging 

sequence from the helper virus genome. The recombinase can be provided by a 
nucleotide sequence in the packaging cell that encodes the recombinase. Such 
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to p53 is unnecessary. Thus, deletion of E1b-55KD should restrict vector 

replication to p53-deficient tumor cells. 

Another approach is to use tumor-selective promoters to control the 
expression of early viral genes required for replication (see, e.g., International 
5 PCT application Nos. WO 96/17053 and WO 99/25860). Thus, in this approach 
the adenoviruses selectively replicate and lyse tumor cells if the gene that is 
essential for replication is under the control of a promoter or other transcriptional 

regulatory element that is tumor-selective. 

For example oncolytic adenoviral vectors that contain a cancer selective 
10 regulatory region operatively linked to an adenoviral gene essential for adenoviral 
replication are known (see, e.g., U.S. Patent No. 5,998,205). Adenoviral genes 
essential for replication include, but are not limited to, El a, Elb, E2a, E2b and 
E4. For example, an exemplary oncolytic adenoviral vector has a cancer 
selective regulatory region operatively linked to the El a gene. In other 
15 embodiments, the oncolytic adenoviral vector has a cancer selective regulatory 
region of the present invention operatively linked to the El a gene and a second 
cancer selective regulatory region operatively linked to the E4 gene. The vectors 
also can include at least one therapeutic transgene, such as, but not limited to, a 
polynucleotide encoding a cytokine such as GM-CSF that can stimulate a 
20 systemic immune response against tumor cells. 

Other exemplary oncolytic adenoviral vectors include those in which 
expression of an adenoviral gene, which is essential for replication, is controlled 
by E2F-responsive promoters, which are selectively transactivated in cancer 
cells. Thus, vectors that contains an adenoviral nucleic acid backbone that 
25 contains in sequential order: A left ITR, an adenoviral packaging signal, a 
termination signal sequence, an E2F responsive promoter which is operably 
linked to a first gene, such as El a, essential for replication of the recombinant 
viral vector and a right ITR (see, published International PCT application No. 
W002/06786, and U.S. Patent No. 5,998,205). 

30 In other embodiments, the oncolytic adenoviral vector has a cancer 

selective regulatory region operatively linked to the El a gene and a second 
cancer selective regulatory region operatively linked to the E4 gene. The vectors 
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and helper viruses for use with helper-dependent vectors. The packaging cell 
line has heterologous DNA stably integrated into the chromosomes of the cellular 
genome. The heterologous DNA sequence encodes one or more adenovirus 
regulatory and/or structural polypeptides that complement the genes deleted or 
5 mutated in the adenovirus vector genome to be replicated and packaged. 
Packaging cell lines express, for example, one or more adenovirus structural 
proteins, polypeptides, or fragments thereof, such as penton base, hexon. fiber, 
polypeptide Ilia, polypeptide V, polypeptide VI, polypeptide VII, polypeptide VIII, 
and biologically active fragments thereof. The expression can be constitutive or 
10 under the control of a regulable promoter. These cell lines are particularly 
designed for expression of recombinant adenoviruses intended for delivery of 
therapeutic products. For use herein, such packaging cell lines can express the 
modified capsid proteins, such as the fiber proteins who binding to HSP is 
reduced or eliminated, and/or the modified penton and hexon proteins. 

15 Particular packaging cell lines complement viral vectors having a deletion 

or mutation of a DNA sequence encoding an adenovirus structural protein, 
regulatory polypeptides El A and El B, and/or one or more of the following 
regulatory proteins or polypeptides: E2A, E2B, E3, E4, L4, or fragments thereof. 

The packaging cell lines are produced by introducing each DNA molecule 
20 into the cells and then into the genome via a separate complementing plasmid or 
plurality of DNA molecules encoding the complementing proteins can be 
introduced via a single complementing plasmid. Of interest herein, is a variation 
in which the complementing plasmid includes DNA encoding adenovirus fiber 
protein (or a chimeric or modified variant thereof), from Ad virus of subgroup D, 

25 such as Ad 37, polypeptide or fragment thereof. 

For applications, such as therapeutic applications, the delivery plasmid 
further can include a nucleotide sequence encoding a heterologous polypeptide. 
Exemplary delivery plasmids include, but are not limited to, pDV44, pAE1B/?-gal 
and pAEIspIB. In a similar or analogous manner, therapeutic nucleic acids, 

30 such as nucleic acids that encode therapeutic genes, can be introduced. 
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Application No. PCT/EP0 1/06286. For different serotypes and strains of 
adenoviruses, loci for insertion of targeting ligands can be empirically 
determined. For different serotypes and strains, such loci can vary. 

Because the adenovirus fiber has a trimeric structure, the ligand can be 
5 selected or designed to have a trimeric structure so that up to three molecules of 
the ligand are present for each mature fiber. Such ligands can be incorporated 
into the fiber protein using methods known in the art (see, e.g., U.S. Patent No. 
5,756,086). Instead of the fiber, the targeting ligand can be included in the 
penton or hexon proteins. Inclusion of targeting ligands in penton (see for 
10 example, in U.S. Patent Nos. 5,731 ,190 and 5,965 >31 ) and in hexon (see for 
example, in U.S. Patent No. 5,965,541) is known. 

In one exemplary embodiment, the ligand is included in a fiber protein, 
which is a fiber protein mutated as described herein. As shown herein, the 
targeting ligand can be included, for example, within the HI loop of the fiber 
15 protein. Any ligand that can fit in the HI loop and still provide a functional virus 
is contemplated herein. Such ligands can be as long as or longer than 80-100 
amino acids (see, e.g., Belousova eta/. (2002) J. Virol. 76:8621-8631). Such 
ligands are added by techniques known in the art (see, e.g., published 
International Patent Application publication No. W099/39734 and U.S. Patent 
20 Application number 09/482,682). Other ligands can be be discovered through 
techniques known to those skilled in the art. Some non-limiting examples of 
these techniques include phage display libraries or by screening other types of 

libraries. 

Targeting ligands include any chemical moiety that preferentially directs 
25 an adenoviral particle to a desired cell type and/or tissue. The categories of such 
ligands include, but are not limited to, peptides, polypeptides, single chain 
antibodies, and multimeric proteins. Specific ligands include the TNF superfamily 
of ligands which include tumor necrosis factors (or TNF's) such as, for example, 
TNFo and TNF/?, lymphotoxins (LT), such as LT-ct and LT-/?, Fas ligand which 
30 binds to Fas antigen; CD40 ligand, which binds to the CD40 receptor of 

B-lymphocytes; CD30 ligand, which binds to the CD30 receptor of neoplastic 
cells of Hodgkin's lymphoma; CD27 ligand, NGF ligand, and OX-40 ligand; 




WO 03/062400 



PCT/US03/02295 



-47- 



in sufficient quantities. This genetic engineering is accomplished by infecting the 
desired cell with an adenoviral particle whose genome includes a desired 
heterologous polynucleotide. The heterologous polynucleotide is then expressed 
in the genetically engineered cells. For use herein the cell is generally a 
5 mammalian cell, and is typically a primate cell, including a human cell. The cell 
can be inside the body of the animal (in vivo ) or outside the body (in vitro). 
Heterologous polynucleotides (also referred to as heterologous nucleic acid 
sequences) are included in the adenoviral genome within the particle and are 
added to that genome by techniques known in the art. Any heterologous 
10 polynucleotide of interest can be added, such as those disclosed in U.S. Patent 
No. 5,998,205, incorporated herein by reference. Polynucleotides that are 

introduced into an Ad genome or vector can be any that encode a protein of 
interest or that are regulatory sequences. Proteins include, but are not limited 
to, therapeutic proteins, such as an immunostimulating protein, such as an 
15 interleukin, interferon, or colony stimulating factor, such as granulocyte 
macrophage colony stimulating factor (GM-CSF; see, e.g., 5,908,763F. 

Generally, such GM-CSF is a primate GM-CSF, including human GM-CSF. Other 
immunostimulatory genes include, but are not limited to, genes that encode 
cytokines IL1, IL2, IL4, IL5, IFN, IFN, TNF, IL1 2, IL18, and flt3), proteins that 
20 stimulate interactions with immune cells (B7, CD28, MHC class I, MHC class II, 
TAPs), tumor-associated antigens (immunogenic sequences from MART-1 , 
gp 100 (pmel- 17 ), tyrosinase, tyrosinase-related protein 1 , tyrosinase-related 
protein 2, melanocyte-stimulating hormone receptor, MAGE1 , MAGE2, MAGE3, 
MAGE12, BAGE, GAGE, NY-ESO-1, -catenin, MUM-1, CDK-4, caspase 8, KIA 
25 0205, HLA-A2R1 701 , -fetoprotein, telomerase catalytic protein, G-250, MUC-1, 

carcinoembryonic protein, p53, Her2/neu, triosephosphate isomerase, CDC-27, 
LDLR-FUT, telomerase reverse transcriptase, and PSMA), cDNAs of antibodies ^ 
that block inhibitory signals (CTLA4 blockade), chemokines (MIP1, M1P3, CCR7 

ligand, and calreticulin), and other proteins. 

30 Other polynucleotides, including therapeutic nucleic acids, such as 

therapeutic genes, of interest include, but are not limited to, anti-angiogenic, and 
suicide genes. Anti-angiogenic genes include, but are not limited to, genes that 
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November 19, 1997, which published as PCT Publication No. WO/9925860), 

Nos, FasL, and sFasR (soluble Fas receptor). 

Also contemplated are combinations of two or more transgenes with 
synergistic, complementary and/or nonoverlapping toxicities and methods of 
5 action. The resulting adenovirus can retain the viral oncolytic functions and, for 
example, additionally are endowed with the ability to induce immune and 
anti-angiogenic responses and other responses as desired. 

Therapeutic polynucleotides and heterologous polynucleotides also 
include those that exert an effect at the level of RNA or protein. These include 
10 include a factor capable of initiating apoptosis, RNA, such as RNAi and other 

double-stranded RNA, antisense and ribozymes, which among other capabilities 
can be directed to mRNAs encoding proteins essential for proliferation, such as 
structural proteins, transcription factors, polymerases, genes encoding cytotoxic 
proteins, genes that encode an engineered cytoplasmic variant of a nuclease 
15 (e.g. RNase A) or protease (e.g. trypsin, papain, proteinase K and carboxy- 

peptidase). Other polynucleotides include a cell or tissue specific promoters, 
such as those used in oncolytic adenoviruses (see, e.g., U.S. Patent No. 
5,998,205). 

The heterologous polynucleotide encoding a polypeptide also can contain 
20 a promoter operably linked to the coding region. Generally the promoter is a 
regulated promoter and transcription factor expression system, such as the 
published tetracycline-regulated systems, or other regulatable systems (WO 
01/30843), to allow regulated expression of the encoded polypeptide. 

Exemplary of other promoters, are tissue-selective promoters, such as those 
25 described in U.S. Patent No. 5,998,205. An exemplary regulatable promoter 
system is the Tet-On( and Tet-Off( systems currently available from Clontech 
(Palo Alto, CA). This promoter system allows the regulated expression of the 
transgene controlled by tetracycline or tetracycline derivatives, such as 
doxycycline. This system can be used to control the expression of the encoded 
30 polypeptide in the viral particles and nucleic acids provided herein. Other 
regulatable promoter systems are known (see, e.g., published U.S. No. 
20020168714, entitled "Regulation of Gene Expression Using Single-Chain, 
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U.S. Patent No. 5,998,205, U.S. Patent No. 5,801,029; U.S. patent application 
60/348,670 and corresponding published International PCT application No. 
W002/06786). These include the cytolytic, cytopathic viruses (or vectors), 

including the oncolytic viruses discussed above. 

5 Alternatively, as discussed above, the vector can include a mutation or 

deletion in the Elb gene. Typically such mutation or deletion in the Elb gene is 
such that the E1b-19kD protein becomes non-functional. This modification of 
the Elb region can be combined with vectors where all or a part of the E3 region 

is present. 

10 The oncolytic adenoviral vector can further include at least one 

heterologous coding sequence, such as one that encodes a therapeutic product. 
The heterologous coding sequence, such as therapeutic gene, is generally, 
although not necessarily, in the form of cDNA, and can be inserted at any locus 
that does not adversely affect the infectivity or replication of the vector. For 
15 example, it can be inserted in an E3 region in place of at least one of the 

polynucleotide sequences that encode an E3 protein, such as, for example, the 
1 9kD or 14.7 kD E3 gene. 

F. Propagation and Scale-up 

Since doubly ablated adenoviral vectors containing mutations in the fiber 
20 and/or penton capsid proteins result in inefficient cell binding and entry via the 
CAR Icn integrin entry pathway, scaled up technologies improve the growth and 
propagation of such vectors to produce high titers of the adenoviral vectors for 
clinical use. Thus, also provided is a method for scaling up the production of 
detargeted adenoviral vectors. The detargeted adenoviral vectors comprise an 
25 adenoviral vector modified to ablate the interaction of said vector with at least 
one host cell receptor compared with a wild-type adenoviral vector. The 
detargeted adenoviral vectors can comprise an adenoviral vector modified to 
ablate the interaction of said vector with one, two, three or more host cell 
receptors. Thus, the method is suitable for producing the detargeted adenoviral 

30 vectors disclosed herein. 

As noted, growth and propagation of doubly and fully ablated adenoviral 
vectors is enhanced by new scale up technologies. Doubly ablated vectors 
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Reagents which are useful in this method are those that are capable of 
directing adenoviral particle entry into the producer cells. Such reagents include, 
but are not limited to, polycations and bifunctional reagents. Suitable 
polycations include, but are not limited to, polytheylenimine; protamine sulfate; 

5 poly-L-lysine hydrobromide; poly{dimethyl diallyl ammonium) chloride 

(Merquat(r)-IOO, Merquat(r)280, Merquat(r)550); poly-L-arginine hydrochloride; 
poly-L-histidine; poly(4-vinylpyridine), poly(4-vinylpyridine) hydrochloride; 
poly( 4 -vinylpyridine)cross-linked, methylchloride quaternary salt; poly(4-vmyl- 
• pyridine-co-styrene); poly(4-vinylpyridinium poly(hydrogen fluoride)); poly(4-vinyl- 
10 pyridinium-P-toluene sulfonate); poly( 4 -vinylpyridinium-tribromide); poly(4-vinyl- 
pyrrolidone-co-2-dimethylamino-ethyl methacrylate); polyvinylpyrrolidone, 
cross-linked; poly vinylpyrrolidone, poly(melamine-co-formaldehyde); partially 
methylated; hexadimethrine bromide; poly(Glu, Lys) 1 :4 hydrobromide; poly(Lys, 
Ala) 3:1 hydrobromide; poly(Lys, Ala) 2:1 hydrobromide; poly-L-lysine 
15 succinylated; poly(Lys, Ala) 1:1 hydrobromide; and poly(Lys, Trp) 1:4 
hydrobromide. 

Suitable bifunctional reagents include, but are not limited to, antibodies or 
peptides that bind to the adenoviral capsid and that also contain a ligand that 
allows interaction with specific cell surface receptors of the producer cells. 

20 Examples of bifunctional reagents include: (a) anti-fiber antibody ligand fusions, 
(b) anti-fiber-Fab-FGF conjugate, (c) anti-penton-antibody ligand fusions, (d) 
anti-hexon antibody ligand fusions and (e) polylysine-peptide fusions. The ligand 
is any ligand that will bind to any cell surface receptor found on the producer 

cells. 

25 

The following examples are included for illustrative purposes only and are 
not intended to limit the scope of the invention. 
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substitution of fiber amino acids 408 and 409, changing them from serine and 

proline to glutamic acid and alanine, respectively. 

The vector was constructed as follows. First, the plasmid pSKOI (Figure 
1) was digested with the restriction enzymes Sphl and Muni. The resulting DNA 
5 fragments were separated by electrophoresis on an agarose gel. The 1 601 bp 
fragment containing all but the 5' end of the fiber gene was excised from the 
agarose gel and the DNA was isolated and purified. The fragment was then 
ligated with the 9236 bp fragment of P 5FloxHRFRGD, which had been digested 
with Sphl and Muni. The resulting plasmid, p5FloxHRFK01 , was digested with 
10, Spel and Pad and the 6867 bp fragment containing the fiber gene was isolated. 
The fragment was ligated with the 24,630 bp Spel-Pacl fragment of pNDSQ3.1 . 
The resulting plasmid, pNDSQ3.1K01 (Figure 2), was used together with 
pAdmireRSVnBg (Figure 3A) to generate a plasmid which encodes the full-length 
adenoviral vector genome. It, however, was necessary to remove the Pad site 
15 from pNDSQ3.1 KOI (Figure 2) prior to recombination with pAdmireRSVnBg 

(Figure 3A) so that the final plasmid contains a unique Pad site adjacent to the 
5' 1TR. The Pad site in pNDSQ3.1K01 was removed by digestion with Pad 
followed by blunting with T4 DNA Polymerase and religation. The resulting 
plasmid was called pNDSQ3.1 KOI (Pac. 

20 To generate a full-length plasmid containing the entire adenoviral genome, 

pAdmireRSVnBg (Figure 3A) was digested with Sail and co-transfected into 
competent cells of the E. coli strain BJ5183 along with pNDSQ3.1 KOI APac, 
which had been digested with BstBI. Homologous recombination between the 
two plasmids generated a full-length plasmid encoding the entire adenoviral 

25 vector genome, which was called pFLAv3nBgFK01 . 

The plasmid pFLAv3nBgK01 was linearized with Pad and transfected into 

633 cells. In the fiber complementing 633 cell line, the resulting viral DNA 
containing the KOI mutation is capable of being packaged into infectious viral 
particles containing a mixture of wildtype fiber and mutant fiber proteins. After 
30 five rounds of amplification in 633 cells, a cytopathic effect was observed. 

Three more rounds of amplification in 633 cells were performed followed by 
purification of the virus by standard CsCI centrifugation procedures. This viral 
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was digested with Xbal and EcoRI and cloned into pSQI using a three-way 
ligation to generate pSQ1K012 (Figure 30. The KOI 2 cDNA was incorporated 
into the genome of AvI nBg, an adenovirus vector with El and E3 deleted 
encoding £-galactosidase, by homologous recombination between Clal-linearized 
5 p sQ1 KOI 2 and pAdmireRSVnBg digested with Sail and Pad to generate 
Avl nBgKOI 2. The KOI 2 vector was transfected in 633 cells, scaled-up on 
non-fiber expressing cells and purified, as described above for KOI. 

AvInBgPDI 

Genetic incorporation of the PD1 penton mutation to generate 
10 AvInBgPDI 

The adenoviral vector AvInBgPDI is an E1-, E3-deleted vector based on 
the adenovirus serotype 5 genome. It contains a RSV promoted 
nuclear-localizing £-galactosidase gene in the El region and also contains the 
PD1 mutation in the penton gene. The PD1 mutation results in a substitution of 
15 amino acids 337 through 344 of the penton protein, HAIRGDTF (SEQ ID No. 9), 
with amino acids SRGYPYDVPDYAGTS (SEQ ID No. 10), thus replacing the RGD 
tripeptide (see, Einfeld et at. (2001 ) J. Virology 75:11284-11291). The 
mutation in the penton gene was generated in the plasmid pGEMpen5, which 
contains the Adenovirus serotype 5 penton gene. To generate the mutation, 

20 four oligonucleotides were synthesized. The sequences of the oligonucleotides 
were as follows: penton 1 : 5' CGC GGA AGA GAA CTC CAA CGC GGC AGC 
CQC GGC AAT GCA GCC GGT GGA GGA CAT GAA 3' (SEQ ID No. 11); penton 
2: 5' TAT CGT TCA TGT CCT CCA CCG GCT GCA TTG CCG CGG CTG CCG 
CGT TGG AGT TCT CTT CC 3' (SEQ ID No. 12); penton 3: 5' CGA TAG CCG 
25 CGG CTA CCC CTA CGA CGT GCC CGA CTA CGC GGG CAC CAG CGC CAC 
ACG GGC TGA GGA GAA GCG CGC 3' (SEQ ID No. 13); penton 4: 5' TCA GCG 
CGC TTC TCC TCA GCC CGT GTG GCG CTG GTG CCC GCG TAG TCG GGC 
ACG TCG TAG GGG TAG CCG CGG C 3' (SEQ ID No. 14). The complementary 
oligonucleotides penton 1 and penton 2 were annealed to each other as were 
30 penton 3 and penton 4. The duplex generated by annealing penton 3 and penton 
4 encoded the substitution of amino acids 337 through 344 described above. 
The duplex generated by annealing penton 1 and penton 2 possessed a 5 base 
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AvInBgFKOlPDI 

Genetic incorporation of the fiber KOI or K012 mutation in combination 
with the penton PD1 mutation to generate AvInBgFKOlPDI 

The adenoviral vectors AvI nBgFKOI PD1 and AvInBgKOI 2PD1 were 

generated in an E1-, E3-deleted adenovirus serotype 5 genome. Both vectors 

contains a RSV promoted nuclear-localizing 0-galactosidase gene in the El region 

and also contains either the KOI or KOI 2 mutation in the fiber gene as well as 

the PD1 mutation in the penton gene. The vectors were constructed as follows. 

First, the plasmid pSQlPDI was digested with Csp45l and Spel and the 23976 

bp fragment containing the PD1 mutated penton gene was purified. In addition, 

the plasmids pSQIKOI or pSQ1K012 (Figure 3B) were digested with Csp45l 

and Spel and the 9090 bp fragment containing the KOI or KOI 2 mutated fiber 

gene were purified. The appropriate purified fragments were ligated to each 

other to from the plasmid pSQIFKOlPDI (Figure 5A) or pSQ1K012PD1 (Figure 

5B) that contains the KOI (or KOI 2) mutated fiber gene and the PD1 mutated 

penton gene. To generate virus, pSQIFKOlPDI or P SQK012PD1 was linearized 

with Clal and co-transfected into 633 cells with pAdmireRSVnBg (Figure 3A) 

which had been digested with Sail and Pad. After three rounds of amplification 

in 633 cells a cytopathic effect was observed and the crude viral lysate was 

then amplified on PerC6 cells. Hexadimethrine bromide was maintained in the 

medium at 4 fjglm\. Each virus was purified by standard CsCI centrifugation 

procedures. 

EXAMPLE 2 

In Vitro Evaluation of Adenoviral Vectors Containing the KOI and PD1 Mutations 
Several recombinant adenoviral vectors were used in these studies to 
demonstrate the function of the KOI fiber mutation and included AvInBg, 

AvI nBgFKOI , AvInBgPDI , and AvI nBgFKOI PD1 , described above. The 
transduction efficiencies of adenoviral vectors containing the KOI and/or PD1 
mutations were evaluated on cells of the alveolar epithelial cell line A549. The 
transduction efficiencies were compared to that of AvInBg, an adenoviral vector 
containing wild type fiber and penton. 
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kidney were collected from each animal. The median lobe of the liver was placed 
in neutral buffered formalin to preserve the sample for /?-galactosidase 
immunohistochemistry. In addition, tissue from each organ was frozen to 
preserve it for hexon PCR analysis to determine vector content. A separate 
5 sample of liver from each mouse was frozen to preserve it for a 
chemiluminescent /?-galactosidase activity assay. 

For /?-galactosidase immunohistochemistry slices of liver, approximately 
2-3 mm thick, were placed in 10% neutral buffered formalin. After fixation, 
these samples were embedded in paraffin, sectioned, and analyzed by 
10 immunohistochemistry for jS-galactosidase expression. A 1 : 1 200 dilution was 
used of a rabbit anti-yff-galactosidase antibody (ICN Pharmaceuticals, Inc.; Costa 
Mesa, CA) in conjunction with a Vectastain ABC kit (Vector Laboratories, Inc., 
Burlingame, CA) to visualize positive cells. 

The chemiluminescent £-galactosidase activity assay was performed 
15 using the Galacto-Light PlusTM chemiluminescent assay (Tropix, Inc., Foster 
City, CA) system. Tissue samples were collected in lysis matrix tubes 
containing two ceramic spheres (Biol 01, Carlsbad, CA) and frozen on dry ice. 

The tissues were thawed and 500 jj\ of lysis buffer from the Galacto-Light Plus 
kit was added to each tube. The tissue was homogenized for 30 seconds using a 
20 FastPrep System (Biol 01, Carlsbad, CA). Liver samples were homogenized for 
an additional 30 seconds. £-galactosidase activity was determined in the liver 
homogenates according to the manufacture's protocol. 

For hexon PCR analysis DNA from tissues was isolated using the Qiagen 
Blood and Cell Culture DNA Midi or Mini Kits (Qiagen Inc., Chatsworth, CA). 

25 Frozen tissues were partially thawed and minced using sterile disposable 

scalpels. Tissues were then lysed by incubation overnight at 55° C in Qiagen 
buffer G2 containing 0.2 mg/ml RNaseA and 0.1 mg/ml protease. Lysates were 
vortexed briefly and then applied to Qiagen-tip 100 or Qiagen-tip 25 columns. 
Columns were washed and DNAs were eluted as described in the manufacturer's 
30 instructions. After precipitation, DNAs were dissolved in water and the 

concentrations were spectrophotometrically determined (A260 and A280) on a 
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transduction compared to AvInBg, suggesting that integrins are involved to 
some extent in hepatic uptake of the adenoviral vectors. 

The results of the immunohistochemical staining of liver sections for 
/?-galactosidase were consistent with the activity assays (data not shown) and 
demonstrate that gene expression was localized specifically to hepatocytes. The 
vectors containing the KOI or KOI 2 mutation alone showed a slight increase in 
liver transduction as revealed by a more intense and frequent 
immunohistochemical-staining pattern. The vectors containing the PD1 
mutation, either alone or combined with KOI or KOI 2, showed little difference 
in transduction compared to AvInBg. These results demonstrate that 
ablating the viral interaction with CAR and/or integrins is not sufficient to fully 
detarget adenoviral vectors from the liver in vivo. 

In summary, the fiber AB loop mutation contained in AvInBgFKOI or 
Av1nBgKO!2 ablates interaction with human and mouse CAR in vitro and 
diminished transduction in vitro. In vivo, however, fiber AB loop mutations 
behaved unexpectantly, because such mutations were found to enhance 
adenoviral-mediated gene transfer to liver and results in increasing vector 
potency. The penton base, PD1 mutation that ablates interaction with the 
second receptor involved in adenoviral internalization had no effect in vitro and 
little to no effect in vivo. These studies indicated that other receptors are 
responsible for adenoviral gene transfer to the liver in vivo. 

EXAMPLE 4 

Description Of Adenoviral Vectors Containing A Fiber With Amino Acid 
Substitutions At The Heparin Sulfate Binding Domain In The Fiber Shaft 

Vectors containing substitutions at all four of the amino acids in the four 
amino acid motif in the Ad5 fiber shaft (residues 91 to 94, KKTK; SEQ ID No. 1) 
were generated in order to ablate the potential interaction with HSP. The 
mutation is termed HSP because it potentially eliminates binding to heparan 
sulfate proteoglycans. Vectors containing the HSP mutation alone and combined 
with the KOI mutation (fiber knob AB loop mutation that ablates CAR binding), 
the PD1 mutation (penton mutation that eliminates RGD/integrin interaction), and 
a triple knockout vector (HSP, KOI, PD1) were generated. 
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To generate a recombinant adenoviral vector containing the HSP mutation 
in the fiber gene, pSQIHSP was digested with Clal and pAdmireRSVnBg (Figure 
3A) was digested with Sail and Pad, then the two digested plasmids were 
co-transfected into 633 cells (von Seggern et al. (2000) J Virology 74\ 354-362). 

5 Homologous recombination between the two plasmids generated a full-length 
adenoviral genome capable of replication in 633 cells, which inducibly express 
Ad5 El A and constitutively express wild-type fiber protein. After propagation 
on 633 cells, the virus capsid contained wildtype and mutant fiber proteins. To 
obtain viral particles containing only the modified fiber with the HSP mutation, 

10 the viral preparation was used to infect PerC6 cells, which do not express fiber. 
The resulting virus, termed AvInBgFS*, was purified by standard CsCI 
centrifugation procedures. 

Generation of vector containing the HSP and KOI mutations 
To generate an adenoviral vector containing the HSP and KOI mutations 
15 in fiber, a PCR SOEing strategy identical to the one described above was used 
except that the plasmid pSQIFKOI was used as the template. The PCR SOEing 
product was digested with Xbal and Muni and ligated with the 6477 bp Xbal to 
Muni fragment of pFBshuttle(EcoRl) to generate pFBSEHSPKOI. The fiber gene 
containing the HSP and KOI mutations was transferred from pFBSEHSPKOI into 
20 the pSQI backbone using a three-way ligation strategy identical to the one 
described above for the HSP mutation alone, to generate the plasmid 
pSQIHSPKOI (Figure 10). Recombinant adenoviral vector containing the HSP 
and KOI mutations in the fiber gene was generated by co-transfecting 
pSQIHSPKOI digested with Clal and pAdmireRSVnBg digested with Sail and 
25 Pad into 633 cells. Adenovirus was propagated and purified as described above 
for the vector containing the HSP mutation alone. The resulting virus was 
termed AvI nBgFKOI S*. 
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5000, 10,000. HeLa cells were transduced with each of the above vectors, as 
well as a vector containing the KOI mutation alone (AvInBgFKOI ) and a vector 
containing the PD1 mutation alone (AvInBgPDI) at 2000 PPC. The cell 
monolayers were stained with X-gal 24 hours after infection and the percentage 
5 of cells expressing jff-galactosidase was determined by microscopic observation 
and counting of cells. Transductions were done in triplicate and three random 
fields in each well were counted, for a total of nine fields per vector. 

The results (depicted in Figures 13A-13B) showed significantly reduced 
transduction efficiencies on A549 and HeLa cells using vectors containing the 
10 HSP mutation compared to AvInBg. The vectors containing the HSP mutations, 
however, demonstrated a dose response on A549 cells, in that increasing PPC 
ratios yielded increasing transduction. 

Competition experiments were done to determine which receptor molecular 
interactions are involved in transduction of A549 cells by the various vectors. 

15 Transductions were performed in the presence or absence of various competitors 
including Ad5 fiber knob, a 50 amino acid oligopeptide derived from Adenovirus 
serotype 2 penton base which spans the RGD tripeptide region, or heparin 
(Invitrogen Life Technologies, Gaithersburg, MD). Monolayers of A549 cells 
were cultured in Richters medium supplemented with 10% FBS and were 
20 transduced with AvInBg, AvInBgS*, AvInBgFKOI S*, Av1nBgS*PD1, or 

AvI nBgFK01S*PD1 in infection medium (IM, Richters medium plus 2% FBS). 
Different PPC ratios were used for the different vectors to achieve measurable 
transduction levels. The PPC ratios were as follows: AvI nBg: 500 PPC, 
AvInBgS*: 10,000 PPC, AvI nBgFKOI S*: 20,000 PPC, AvI nBgS*PD1 : 10,000 
25 PPC, and AvI nBgFKOI S*PD1 : 20,000 PPC. Fiber knob competition was 

performed by pre-incubating cells in IM containing 1 6 of fiber knob for 1 0 

minutes at room temperature prior to infection with virus. Penton base peptide 
competition was performed by pre-incubating cells in IM containing 500nM 
peptide for 10 minutes at room temperature prior to infection with virus. , 

30 Heparin competition was performed by pre-incubating each adenoviral vector in 
IM containing 3 mg/ml of heparin for 20 minutes at room temperature. In all 
cases, the competitor remained in the IM during the 1 hour infection when virus 
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separate sample of liver from each mouse was frozen to preserve it for a 
chemiluminescent /?-galactosidase activity assay. /?-galactosidase 
immunohistochemistry, hexon real-time PCR and the chemiluminescent 
/?-galactosidase activity assay were carried out as described in Example 3. 

The results of the /?-galactosidase activity assay (Figure 14A) and adenoviral 
hexon DNA content (Figure 14B) showed a dramatic reduction in liver 
transduction by vectors containing the HSP mutation. The vectors containing 
the HSP mutation alone resulted in reducing adenoviral-mediated liver gene 
expression by approximately 20-fold .When combined with the KOI mutation 
(HSP, KOI, PD1), yielded approximately a 1000-fold reduction in /?-galactosidase 
activity in the liver compared to the control vector AvInBg. The vector 
containing the KOI mutation alone showed a slight increase, on average, in liver 
transduction compared to AvInBg, which is consistent with several previous 
experiments. The vectors containing the PD1 mutation alone or combined with 
KOI showed a slight decrease in liver transduction compared to AvInBg, 
although the decrease was not statistically significant. Analysis of hepatic 
adenoviral hexon DNA content (Figure 1 4B) confirmed these results. 

The results of the immunohistochemical staining of liver sections for /?- 
galactosidase were consistent with the activity assays (data not shown) and 
demonstrated that gene expression was localized specifically to hepatocytes. 
Vectors containing the HSP mutation, either alone or in combination with KOI 
and/or PD1, showed a dramatic reduction in hepatocyte transduction. The 
vector containing the KOI mutation alone showed a slight increase in liver 
transduction as revealed by a more intense and frequent immunohistochemical 
staining pattern. The vectors containing the PD1 mutation, either alone or 
combined with KOI, showed little difference in transduction compared to 
AvInBg. 

EXAMPLE 7 

Description of Adenoviral Vectors Containing the HSP Fiber Shaft Mutation with 
and without the KOI Fiber Mutation and with and without a cRGD Targeting 
Ligand in the Fiber Knob HI Loop 
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described in Example 1 above, was digested with BstXI and Kpnl and the 4549 
bp and 3156 bp fragments were isolated and purified. The three fragments were 
ligated to generate the plasmid pFBSEHSPKOI RGD, which encodes a fiber 
containing the HSP mutation, the KOI mutation, and cRGD in the HI loop. The 
5 fiber gene from this plasmid was transferred into the pSQI backbone as follows. 
The plasmid pFBSEHSPKPOl RGD was digested with EcoRl and Xbal and the 
7601 bp fragment was isolated and purified. The plasmid pSQI (Figure 3B) was 
digested with the restriction enzymes EcoRl, Ndel, and Xbal and the 16,431 bp 
EcoRl to Ndel fragment and the 9043 bp Ndel to Xbal fragment were isolated 
10 and purified. The three DNA fragments were ligated to generate the plasmid 
pSQI HSPKOI RGD (Figure 15B). 

To generate a recombinant adenoviral vector containing the HSP and KOI 
mutations in the fiber gene along with a cRGD ligand in the HI loop, the plasmid 
pSQI HSPKOI RGD was digested with Clal and co-transfected into 633 cells with 
15 pAdmireRSVnBg which had been digested with Sail and Pad. After propagation 
on 633 cells, the virus capsid contained wildtype and mutant fiber proteins. To 
obtain viral particles containing only the modified fiber with the HSP and KOI 
mutations and a cRGD ligand, the viral preparation was used to infect PerC 6 
cells, which do not express fiber. The resulting virus, termed 
20 AvI nBgFK01S*RGD, was purified by standard CsCI centrifugation procedures. 

EXAMPLE 8 

In Vitro Evaluation of Adenoviral Vectors Containing the HSP Fiber Shaft 
Mutation with or without the Fiber Knob KOI Mutation and with or without a 
cRGD Ligand in the HI Loop 

25 The transduction efficiencies of adenoviral vectors containing the HSP 

fiber shaft mutation with or without the fiber KOI mutation and with or without 
the cRGD ligand in the HI loop were evaluated on A549 cells. The transduction 
efficiencies were compared to that of AvInBg, an adenoviral vector containing 
wild type fiber. The day prior to infection, cells were seeded into 24-well plates 
30 at a density of approximately 1 x 10 5 cells per well. Immediately prior to 
infection, the exact number of cells per well was determined by counting a 
representative well of cells. Each of the vectors, AvInBg, AvInBgS , 
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corresponds to bp 25,308 through 25,334 of pSQI . The DNA sequence of 
P-0006/L was as follows: 5' TCT TGG TCA TCT GCA ACA ACA TGA AGA TAG 
TG 3' (SEQ ID No. 18). It contains a 10 base pair 5' extension that is 
complementary to the start of the Ad35 fiber gene, while the remainder of the 
5 primer anneals to the sequence immediately 5' of the ATG start codon of the 
fiber gene in pSQI . A PCR product of the expected size, 583 bp, was obtained 
and the DNA was gel purified. A second PCR amplified the Ad35 fiber gene 
using DNA extracted from wildtype Ad35 virus as a template. The primers used 
for this reaction were P-0007/U and 35FMun. The DNA sequence of P-0007/U 
10 was as follows: 5' GT TGT TGC AG ATG ACC AAG AGA GTC CGG CTC A 3' 

(SEQ ID No. 19). It contains a 10 base pair 5' extension that is homologous to 
the 10 bp immediately prior to the ATG start codon of the fiber gene in Ad5. 

The remainder of the primer anneals to the start of the Ad35 fiber gene. The 
DNA sequence of 35FMun was as follows: 5' AG CAA TTG AAA AAT AAA 
1 5 CAC GTT GAA ACA TAA CAC AAA CGA TTC TTT A GTT GTC GTC TTC TGT 
AAT GTA AGA A 3' (SEQ ID No. 20). It contains a 46 base pair 5' extension 
that is complementary to the region of the Ad5 genome between the end of fiber 
and the Muni site 40 bp downstream of the fiber gene. In addition, the 5' 
extension encodes the last amino acid and stop codon of the Ad5 fiber gene. 

20 This region was retained in the vector because it contains the polyadenylation 
site for the fiber gene. The remainder of the primer anneals to the 3' end of the 
Ad35 fiber gene, up to the next to last amino acid codon. A PCR product of the 
expected size, 1027 bp, was obtained and the DNA was gel purified. The two 
PCR products were mixed and joined together by PCR SOEing using primers 
25 P-0005/U and P-0009. The DNA sequence of P-0009 was as follows: 5' AG 

CAA TTG AAA AAT AAA CAC GTT G 3' (SEQ ID No. 21 ). It corresponds to bp 
27,648 through 27,669 of pSQI and overlaps the Muni site in that region. A 
PCR product of the expected size, 1 590 bp, was obtained and gel purified. It 
was cloned into the plasmid pCR4blunt-T0P0 (Invitrogen Corporation, Carlsbad 
30 CA) using the Zero Blunt TOPO PCR Cloning Kit from Invitrogen. This 

intermediate cloning step simplified DNA sequencing of the PCR SOEing product. 
The resulting plasmid, termed P TOPOAd35F, was digested with Xbal and Muni 
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together. The DNA fragment generated in the second PCR was digested with 
Xbal and Muni and was cloned directly into pFBshuttle(EcoR1 ) to create the 
fiber shuttle plasmid pFBshuttle5TS35H. 

TABLE 2 

5 Primers Used For The Exchange Of Fiber Shaft Regions Between Ad5 And Ad35 

Fibers 



Primer 

designation 


Sequence 


SEQ ID 


pi 


5'-GAACAGGAGGTGAGCTTAGA-3' 


22 


P2 


5'-GTTAGGTGGAGGGTTTATTCCGGTCCAC 
AAAGTTAG CTTATC-3' 


23 


P3 


5'-GATAAGCTAACTTTGTGGACCGGAATAAA 

CCCT CC ACCT AAC-3' 


24 


P4 


5'-GTGGCAGGTTGAATACTAGG-3 


25 


P5 


5'-GTTAGGAGATGGAGCTGGTGTAGTCCATA 

AGGTGTTAATAC-3' 


26 


P6 


5'-GT ATT AAC ACCTT ATGG ACT ACACC AG CT 
CCATCTCCT AAC-3' 


27 


P7 


5'-T G CG C AAA AACAAT C ACC ACG ACAAT CACAAT 
GT AC ATT G G A AG AA AT CAT ACG -3' 


28 


P8 


5'-ACATTGTGATTGTCGTGGTGATT 
GTTTTTG CG C AT ATG CC AT AC AATTTG A ATG-3 


29 



For the construction of the 35TS5H chimera, the pFBshuttleAd35 plasmid 
was used as the template with the PI and P5 primers to generate the 5 
20 fragment. The 3' fragment was generated using the pFBshuttle(EcoR1 ) plasmid 
as the template with the P6 and P4 primers. Following the same procedure 
described above, the fiber shuttle plasmid pFBshuttle35TS5H was generated. 

For the 35TS5H and 5TS35H chimeras, the fiber gene was transferred 
from the pFBshuttle(EcoRI) backbone into pSQI as described above for the 
25 vector containing the Ad35 fiber. The resulting plasmids were called 

pSQ135T5H (Figure 18A) and pSQ15T35H (Figure 18B). In addition, adenoviral 
vectors were generated using the co-transfection strategy described above. 

Construction of Ad5 vectors containing the Ad35 fiber with a cRGD 
targeting peptide in the HI loop of the 35F fiber knob: To incorporate the cRGD 
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Av1nBg35T5H vector behaves similarly to the AvInBgS* vector in vitro and in 
vivo. These studies also demonstrate that vectors containing fiber proteins 
without an HSP binding site are fully viable. 

EXAMPLE 1 1 

5 in Vivo Evaluation Of Adenoviral Vectors Containing 35F And Derivatives 
Thereof 

The objective of this study was to evaluate the in vivo biodistribution of 
adenoviral vectors containing 35F fibers and derivatives thereof to determine 
whether vectors containing these fibers ablate liver transduction due to their 
10 shaft regions. A positive control cohort received AvInBg and a negative control 
group received HBSS. Cohorts of five C57BL/6 mice received each vector via 
tail vein injection at a dose of 1 x 10 13 particles per kg. The animals were 
sacrificed approximately 72 hours after vector administration by carbon dioxide 
asphyxiation. Liver, heart, lung, spleen, and kidney were collected from each 
15 animal. The median lobe of the liver was placed in neutral buffered formalin to 
preserve the sample for //-galactosidase immunohistochemistry. In addition, 
tissue from each organ was frozen to preserve it for hexon PCR analysis to 
determine vector content. A separate sample of liver from each mouse was 
frozen to preserve it for a chemiluminescent //-galactosidase activity assay. //- 
20 galactosidase immunohistochemistry, hexon real-time PCR and the 

chemiluminescent //-galactosidase activity assay were carried out as described in 
example 3. 

The results of the //-galactosidase activity assay showed a dramatic 
reduction in liver transduction by vectors containing the Ad35 fiber or the 
25 35T5H derivative (Figure 20) with an approximately 4- to 24-fold reduction in //- 

galactosidase activity in the liver compared to the control vector AvInBg. These 
data demonstrate that shaft domains without HSP binding sites can effectively 
ablate hepatic in vivo gene transfer. In particular, HSP is the major entry 
mechanism for liver in vivo. CAR binding is a minor entry pathway. 
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5' extension corresponding to the sequence in pSQI from the last codon of the 
fiber gene through the Muni site 40 bp downstream of the fiber gene. The 
remainder of the primer anneals to the 3' end of the Ad41 s fiber gene in 
pDV60Ad41 sF. The PCR product was the expected size (1219 bp). The two 
5 PCR products were joined by PCR SOEing using primers P-0005/U and P-0009/L. 
The DNA sequence of P-0009/L was described above. The PCR SOEing reaction 
yielded the expected 1 782 bp product. The product was cloned into 
pCR4blunt-TOPO to yield pCR4blunt-T0P0Ad41sF. Next, 
pCR4blunt-T0P0Ad41sF was digested with Xbal and Muni and the 1773 bp 
10 fragment containing the Ad41s fiber gene was gel purified. This fragment was 
ligated with the 6477 bp Xbal to Muni fragment of pFBshuttle(EcoRI) to generate 
pFBshuttle Ad4 1 sF. The Ad41 s fiber gene was transferred into the pSQI 
backbone as follows. First, pFBshuttle Ad41 sF was linearized using EcoRI and 
this fragment was ligated with the 24,213 bp EcoRI fragment of pSQI to 
15 generate pSQ1Ad41sF (Figure 21 A). Adenoviral vector containing the Ad41s 
fiber was generated using the co-transfection strategy described above. 

Construction of Ad5 adenoviral vectors containing the Ad41 short fiber 
with a cRGD targeting ligand in the HI loop: A PCR SOEing strategy was used to 
generate a construct containing the Ad41s fiber with cRGD in the HI loop. The 
20' plasmid pFBshuttleAd41 sF was used as a template for the PCR amplifications. 
First, a 1782 bp fragment was amplified using primers 5FF and 41sRGDR. The 
primer 5FF was described above. It anneals to pFBshuttleAd41 sF at the Xbal 
site upstream of the fiber gene. The DNA sequence of the primer 41sRGDR was 
as follows: 5' AGT ACA AAA ACA ATC ACC ACG ACA ATC ACA GTT TAT 
25 CTC GTT GTA GAC GAC ACT GA 3' SEQ ID No. 34). It contains a 30 bp 5' 

extension that encodes the cRGD targeting ligand. The remainder of the primer 
anneals to pFBshuttleAd41 sF from bp 2878 through 2903. A second PCR 
amplified a 277bp region of pFBshuttleAd41 sF using primers 3FR and 41sRGDF. 
The primer 3FR was described previously. It anneals to pFBshuttle Ad41 sF at the 
30 Muni site downstream of the fiber gene. The DNA sequence of 41sRGDF was 
as follows: 5' TGT GAT TGT CGT GGT GAT TGT TTT TGT ACT AGT GGG TAT 
GCTTTT ACT TTT 3' (SEQ ID No. 35). It contains a 30 bp 5' extension that 
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chemiluminescent /?-galactosidase activity assay was carried out as described in 
example 3. 

The results of the hexon DNA analysis showed a dramatic reduction in 
liver transduction by vectors containing the Ad41 sF fiber (Figure 22) with an 
5 approximately a 5-fold reduction in liver adenoviral DNA content compared to 
either control vector. 

In the above examples, several novel adenoviral vectors were generated 
containing various fiber modifications designed to ablate the normal tropism of 
the vector. See Table 3. Vectors were generated in which the heparan sulfate 
10 binding domain in the fiber shaft was replaced by amino acid substitutions. This 
mutation, termed HSP, was also combined with the KOI mutation (fiber knob AB 
loop mutation that ablates CAR binding), and the PD1 mutation (penton mutation 
that eliminates RGD/integrin interaction). In addition, a vector containing all three 
mutations (HSP, KOI, PD1 ) was generated. All vectors containing the HSP 
15 mutation, either alone or combined with other capsid modifications, showed 
dramatically reduced transduction efficiencies on A549 and HeLa cells. 
Furthermore, the same vectors showed dramatically reduced transduction of the 
liver following systemic delivery to mice. As an alternative strategy to ablate the 
normal tropism of Ad5-based vectors, the Ad5 fiber was replaced by a fiber from 
20 a different adenovirus serotype which does not bind CAR and does not contain 
the heparan binding domain in the shaft. Thus, vectors were generated 
containing the Ad35 fiber and the Ad41 short fiber. Versions of these two 
vectors containing a cRGD targeting ligand in the HI loop of the fiber were also 
produced. Additionally, vectors containing chimeric fibers were generated. A 
25 vector containing the Ad35 fiber tail and shaft regions fused to the Ad5 fiber 

knob domain as well as a vector containing the Ad5 fiber tail and shaft fused to 
the Ad35 fiber knob domain were constructed. Vectors containing either the 
entire Ad35 or Ad41 short fiber showed a significant reduction in liver 
transduction following delivery to mice via the tail vein. The observation of 
30 reduced liver transduction using vectors containing either an HSP mutation, the 
Ad35 fiber, or the Ad41 short fiber indicates the feasibility of detargeting 
adenoviral vectors in vivo, in vitro data with the Ad35 fiber or the Ad41 short 
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Vector 


Description 


AvI nBg41 sFRGD 


The same as AvInBg but containing the full length Ad41 short 
fiber cDNA with a cRGD ligand in the HI loop of the Ad41 
short fiber 



EXAMPLE 14 

In Vitro Evaluation Of Adenoviral Vectors Containing The Ad41sF With A cRGD 
5 Ligand In The HI Loop 

The transduction efficiencies of adenoviral vectors containing the Ad41 sF 
fiber with the cRGD ligand in the HI loop were evaluated on A549 cells. The 
transduction efficiencies were compared to that of AvInBg, an adenoviral vector 
containing wild type fiber or AvI nBgFKOI RGD, an adenoviral vector containing 
10 the KOI mutation in combination with the cRGD ligand in the HI loop. The day 
prior to infection, cells were seeded into 24-well plates at a density of 
approximately 1 x 10 5 cells per well. Immediately prior to infection, the exact 
number of cells per well was determined by counting a representative well of 
cells. Each of the vectors, AvInBg, AvI nBgFKOI RGD, and AvI nBg41 sFRGD 
15 were used to transduce A549 cells at a particle to cell ratios of O up to 10,000. 
The cell monolayers were stained with X-gal 24 hours after infection and the 
percentage of cells expressing /?-galactosidase was determined by microscopic 
observation and counting of cells. Transductions were done in triplicate and 
three random fields in each well were counted, for a total of nine fields per 
20 vector. The results {Figure 23) show that the AvI nBg41 sFRGD vector 

transduced cells to an equivalent level as AvInBgFKOI RGD at all vector doses 
examined. Neither FK01 or Ad41sF can bind CAR. The Ad41sF does not 
normally interact with CAR and additionally does not contain the HSP binding 
motif within the shaft domain. These data show that targeting peptides inserted 
25 into the loop regions of the fiber knob of KOI and Ad41sF allows for 

transduction of target cells via the targeted receptor. Surprisingly, HSP, not 
CAR and integrins, is the major entry route in vivo and ablation of HSP binding 
permits targeting of adenoviral vectors. 
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significantly different than the AvInBg ( + ) control using a unpaired, t-test 
analysis, P value ( 0.024. When expressed as a percent of the AvInBg control 
values, the influence of each mutation, individually or in combination, becomes 
apparent. The S* mutation dramatically reduced gene transfer to all four organs, 

5 whereas, the KOI mutation did not. Thus, the importance of the shaft for 

transduction in vivo extends to organs besides the liver. Finally, gene transfer to 
the lung, heart, and kidney was diminished with PD1 suggesting a role for 
integrin binding in vector entry in these organs. 

EXAMPLE 16 

10 Retargeting the S*, shaft modification and the 41 sF fiber in vivo 

Vectors containing the HSP mutation have been shown to effectively 
detarget adenoviral vectors in vivo (see examples 6 and 15). The objective of 
this study was to evaluate the ability to retarget vectors containing the S* 
modification or the Ad41sF to tumors in vivo. A cRGD peptide was genetically 
15 incorporated into the fiber HI loop and evaluated in vitro (Examples 8 and 14). 

These same vectors were then evaluated in vivo in tumor-bearing mice. Athymic 
nu/nu female mice were injected with 8 x 10 6 A549 cells on the right hind flank. 
When tumors reached approximately 100mm3 in size, they were randomized 
into treatment groups. Cohorts of 6 mice received each vector via tail vein 
20 injection at a dose of 1 x 10 13 particles per kg. The animals were sacrificed 
approximately 72 hours after vector administration by carbon dioxide 
asphyxiation. Tumor, liver, heart, lung, spleen, and kidney were collected from 
each animal. Tissue from each organ was frozen to preserve it for real time PCR 
analysis to determine adenoviral hexon DNA content. Hexon real-time PCR was 
25 carried out as described in example 3. A separate sample of liver from each 

mouse was frozen to preserve it for a chemiluminescent yS-galactosidase activity 
assay. Hexon real-time PCR and the chemiluminescent yS-galactosidase activity 
assay was carried out as described in example 3. 

The adenoviral vector biodistribution to the liver and tumor for each 
30 treatment group is shown in Figure 27. Vectors containing the S*, KOIS*, and 
41 sF fibers effectively detargeted the liver and tumor resulting in a significant 
reduction in the amount of adenoviral DNA found in each tissue in comparison to 
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prior to infection. The infection was carried out for 2 hrs. Complete medium 
containing hexadimethrine bromide at 4 ^/g/ml was added to each plate. Final 
concentration of hexadimethrine bromide in all of these experiments was 
maintained at 4 ji/g/ml. The titers were determined spectrophotometrically using 
5 the conversion of 1 0D at A260nm per 1 x 1 0 12 particles (Mittereder et at. 

(1996) J Virology 70: 7498-7509). The total particle yield was then normalized 

for the number of plates used for transduction. 

The inclusion of hexadimethrine bromide in the medium during the course 
of infection allows for the efficient propagation of detargeted adenoviral vectors 
10 containing fiber and penton mutations either alone or in combination. The affect 
of hexadimethrine bromide on vector yields is shown in Table 4. A 35-fold 
improvement in the yield of Av3nBgFK01 was found when hexadimethrine 
bromide was included in the culture medium and resulted in increasing the vector 
yield from 1 .3 x 10 10 up to 4.6 x 10 11 vector particle per plate. Hexadimethrine 
15 bromide has a minimal effect on the yield of the AvI nBgPDI adenoviral vector 
containing the penton, PD1 mutation with only a 1.2 fold improvement. The 
greatest effect using hexadimethrine bromide was found on the propagation of 
the doubly ablated adenoviral vector, AvI nBgFKOI PD1 with increases in vector 
yield from barely detectable levels up to 4.53 x 10 10 vector particles per plate. 

20 These data demonstrate that use of nonspecific entry mechanisms allows for the 
efficient scale-up of detargeted adenoviral vectors. 

TABLE 4 

Efficient Scale-Up Of Detargeted Adenoviral Vectors Using hexadimethrine 



bromide 



Vector 


Vector Yield (particles/plate) 


Fold 

Improvement 


{-) hexadimethrine 
bromide 


( + ) hexadimethrine 
bromide 


AvInBg 


3.89 x 10 11 


5.72 x 10 11 


1.47 


Av3nBg 


8.58 x 10 10 


2.38 x 10 11 


2.77 


Av3nBgFK01 


1.30 x 10 10 


4.60 x 10 11 


35.4 


AvInBgPDI 


1.95 x 10 11 


2.40 xIO" 


1.23 


AvI nBgFKOI PD1 


TLTC* 


4.53 x 10 10 


t 
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10 ppc - 
anti-penton 
TNF 


10 ppc + 
anti-penton 
TNF 


100 ppc - 
anti-penton TNF 


1 00 ppc + 
anti-penton TNF 




Percentage of CPE 




48 h 


20-30% 


20-30% 


90-100% 


90-100% 


72 h 


60-70% 


80-90% 


100% 


100% 


120 h 1 


100% 


100% 


100% 


100% 


AvSnBgl 


COI 24hrs 




24 h 


0% 


0% 


0% 


0% 


48 h 


0% 


10-20% 


0% 


90-100% 


72 h 


5% 


60-70% 


5% 


100% 


120 h 


40-50% 


100% 


100% 


100% 



10 

Since modifications will be apparent to those of skill in this art, it is 
intended that this invention be limited only by the scope of the appended claims. 























WO 03/062400 



PCT/US03/02295 



-91- 

the portion comprises a sufficient portion to alter HSP binding of 
the resulting protein. 

1 1 . The modified protein of claim 10, wherein the binding to HSP of 
the modified fiber protein is eliminated or reduced compared to the unmodified 

5 protein. 

12. The modified protein of claim 10, wherein the remainder of the fiber 
protein is from the second adenovirus. 

13. The modified protein of any of claims 2, 3, 10 and 1 1, further 
comprising one or more further modifications that reduce or eliminate interaction 

10 of the resulting fiber with one or more cell surface proteins in addition to HSP. 

14. The modified protein of claim 13, further comprising a ligand, 
whereby the resulting fiber binds to a receptor for the ligand. 

1 5. The modified protein of claim 14, wherein the ligand is included in 
the knob region. 

15 16. The modified protein of claim 14, wherein the ligand is inserted or 

it replaces a portion of the fiber, whereby the resulting fiber binds to a receptor 
for the ligand. 

17. A modified protein of claim 1 1, wherein affinity for HSP is reduced 
at least by an amount selected from among reduced 5-fold, 10-fold and 100- 

20 fold. 

18. The modified protein of claim 11, wherein the first adenovirus is 
selected from the group consisting of subgroup B, D or F, and the second is of 
subgroup C. 

19. The modified protein of claim 10, wherein the first adenovirus is 

25 selected from the group consisting of Ad3, Ad35, Ad7, Adi 1, Adi 6, Ad21, 

Ad34, Ad40, Ad41 and Ad46. 

20. The modified protein of claim 18, wherein the second adenovirus 
is Ad5 or Ad2. 

21. The modified protein of claim 19, wherein the second adenovirus 

30 is Ad 5 or Ad2. 

22. A modified protein of claim 1 selected from the group consisting of 
a fiber protein comprising: 
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37. An adenoviral particle, comprising a modified protein of any of 
claims 1-12 and 14-22, whereby binding of the viral particle to HSP is altered 
compared to a particle that expresses an unmodified fiber. 

38. An adenoviral particle, comprising a modified protein of claim 13, 
whereby binding of the viral particle to HSP is altered compared to a particle that 
expresses an unmodified fiber. 

39. An adenoviral particle of claim 37, wherein a native receptor for 

the fiber is coxsackie-adenovirus receptor (CAR). 

40. The adenoviral particle of claim 39, further comprising a mutation 

in the CAR-binding region of the capsid. 

41 . The adenoviral particle of claim 39, further comprising a mutation 
in the a v integrin-binding region of the capsid, whereby binding to the integrin is 
eliminated or reduced. 

42. The adenoviral particle of claim 40, further comprising a mutation 
in the a v integrin-binding region of the capsid, whereby binding to the integrin is 
eliminated or reduced 

43. The adenoviral particle of claim 39, wherein the CAR-binding 

region of the capsid modified is on a fiber knob. 

44. The adenoviral particle of claim 43, wherein the fiber knob 

modification is in the AB loop or CD loop. 

45. The adenoviral particle of claim 44, wherein the fiber knob 
modification is selected from the group consisting of KOI and KOI 2. 

46. The adenoviral particle of claim 39, wherein the adenovirus is a 

subgroup C, D or F adenovirus. 

47. The adenoviral particle of claim 46, wherein the subgroup C virus 
is Ad2 or Ad5, the subgroup D virus is Ad46 and the subgroup F virus is Ad41 . 

48. The adenoviral vector of claim 27 that is an early generation 
adenoviral vector, a gutless adenoviral vector or a replication-conditional 
adenoviral vector. 

49. The adenoviral vector of claim 28 that is an early generation 
adenoviral vector, a gutless adenoviral vector or a replication-conditional 
adenoviral vector. 
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the adenoviral vector is an oncolytic vector; and 

the cell is killed . 

66. The method of claim 64-, wherein the cell is a mammalian cell. 

67. The method of claim 64, wherein the cell is a primate cell. 

5 68. The method of claim 67, wherein the cell is a human cell. 

69. A method of reducing transduction of liver cells by an adenoviral 
particle, comprising reducing or eliminating binding of the particle to heparin 
sulfate proteoglycans (HSPs) on the liver cells. 

70. A scale up method for the propagation of a detargeted adenoviral 

10 particle, comprising: 

infecting a cell capable of replicating, maturing and packaging an 
adenoviral vector with a detargeted adenoviral vector in the presence of a 
reagent that results in entry of the adenoviral particle into the cell; 

culturing the infected cell under conditions suitable for growth, spread 
1 5 and propagation of the adenoviral vector; and 

recovering the resulting adenoviral particles. 

71. The method of claim 70, wherein the reagent is a polycation. 

72. The method of claim 71 , wherein the polycation is selected from 
the group consisting of hexadimethrine bromide, polyethylenimine, protamine 

20 sulfate and poly-L-lysine. 

73. The method of claim 70, wherein the reagent is a bifunctional 
protein that binds to the adenoviral particle and to a receptor on the cell. 

74. The method of claim 73, wherein: 

the bifunctional protein is selected from the group consisting of an 
25 anti-fiber antibody ligand fusion, an anti-fiber-Fab-FGF conjugate, an 

anti-penton-antibody ligand fusion, an anti-hexon antibody ligand fusion and a 
polylysine-peptide fusion, wherein the ligand is a ligand that binds to the 
receptor. 

75. The method of any one of claims 70-74, wherein the detargeted 
30 adenoviral particle expresses a modified capsid, whereby binding to at least one 

host cell receptor is reduced or eliminated compared with a wild-type 



adenovirus. 
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Figure 2 
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Figure 4 
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Figure 5A 
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Figure 5B 
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Figure 6 
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Figure 10 
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Figure 11 
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Figure 12 
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Figure 13A 




Figure 13B 
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Figure 15A 
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Figure 15B 
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Figure 18A 
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Figure 21A 
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Figure 21B 
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SEQUENCE LISTING 

<110> The Scripps Research Institute 
Novartis AG 
Kaleko, Michael 
Nemerow, Glen R. 

Smith, Theodore 
Stevenson, Susan C. 

<120> Fiber Shaft Modifications for Efficient Targeting 



<130> 22908- 123 6PC 

<140 > Not yet assigned 
<141> Herewith 

<150 > 60/350,388 
<151> 2002-01-24 

<150> 60/391,967 
<151> 2002-06-26 

<160> 72 

<170> Patentln version 3.0 

< 210 > 1 
< 211 > 4 

<212 > PRT 

<213 > adenovirus serotype 5 

<400> 1 

Lys Lys Thr Lys 
1 

< 210 > 2 
<211> 1746 

<212> DNA 

<213 > adenovirus serotype 5 
<400 > 2 

atgaagcgcg caagaccgtc tgaagatacc 
accggtcctc caactgtgcc ttttcttact 
gagagtcccc ctggggtact ctctttgcgc 
atgcttgcgc tcaaaatggg caacggcctc 
caaaatgtaa ccactgtgag cccacctctc 
atatctgcac ccctcacagt tacctcagaa 
atggtcgcgg gcaacacact caccatgcaa 
aaacttagca ttgccaccca aggacccctc 
acatcaggcc ccctcaccac caccgatagc 
ctaactactg ccactggtag cttgggcatt 
ggaaaactag gactaaagta cggggctcct 
accgtagcaa ctggtccagg tgtgactatt 
ggagccttgg gttttgattc acaaggcaat 
attgattctc aaaacagacg ccttatactt 
caactaaatc taagactagg acagggccct 
attaactaca acaaaggcct ttacttgttt 
gttaacctaa gcactgccaa ggggttgatg 
ggagatgggc ttgaatttgg ttcacctaat 
attggccatg gcctagaatt tgattcaaac 
cttagttttg acagcacagg tgccattaca 
ttgtggacca caccagctcc agaggctaac 



ttcaaccccg tgtatccata tgacacggaa 
cctccctttg tatcccccaa tgggtttcaa 
ctatccgaac ctctagttac ctccaatggc 
tctctggacg aggccggcaa ccttacctcc 
aaaaaaacca agtcaaacat aaacctggaa 
gccctaactg tggctgccgc cgcacctcta 
tcacaggccc cgctaaccgt gcacgactcc 
acagtgtcag aaggaaagct agccctgcaa 
agtaccctta ctatcactgc ctcaccccct 
gacttgaaag agcccattta tacacaaaat 
ttgcatgtaa cagacgacct aaacactttg 
aataatactt ccttgcaaac taaagttact 
atgcaactta atgtagcagg aggactaagg 
gatgttagtt atccgtttga tgctcaaaac 
ctttttataa actcagccca caacttggat 
acagcttcaa acaattccaa aaagcttgag 
tttgacgcta cagccatagc cattaatgca 
gcaccaaaca caaatcccct caaaacaaaa 
aaggctatgg ttcctaaact aggaactggc 
gtaggaaaca aaaataatga taagctaact 
tgtagactaa atgcagagaa agatgctaaa 



60 
12 0 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
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ctgaaatacg agtgggtgga gttcacgctg cccgagggca actactccga gaccatgacc 540 

atagacctta tgaacaacgc gatcgtggag cactacttga aagtgggcag acagaacggg 600 

gttctggaaa gcgacatcgg ggtaaagttt gacacccgca acttcagact ggggtttgac 660 

cccgtcactg gtcttgtcat gcctggggta tatacaaacg aagccttcca tccagacatc 720 

attttgctgc caggatgcgg ggtggacttc acccacagcc gcctgagcaa cttgttgggc 780 

atccgcaagc. ggcaaccctt ccaggagggc tttaggatca cctacgatga tctggagggt 840 

ggtaacattc ccgcactgtt ggatgtggac gcctaccagg cgagcttgaa agatgacacc 900 

gaacagggcg ggggtggcgc aggcggcagc aacagcagtg gcagcggcgc ggaagagaac 960 

tccaacgcgg cagccgcggc aatgcagccg gtggaggaca tgaacgatag ccgcggctac 1020 

ccctacgacg tgcccgacta cgcgggcacc agcgccacac gggctgagga gaagcgcgct 1080 

gaggccgaag cagcggccga agctgccgcc cccgctgcgc aacccgaggt cgagaagcct 1140 

cagaagaaac cggtgatcaa acccctgaca gaggacagca agaaacgcag ttacaaccta 1200 

ataagcaatg acagcacctt cacccagtac cgcagctggt accttgcata caactacggc 1260 

gaccctcaga ccggaatccg ctcatggacc ctgctttgca ctcctgacgt aacctgcggc 1320 

tcggagcagg tctactggtc gttgccagac atgatgcaag accccgtgac cttccgctcc 1380 

acgcgccaga tcagcaactt tccggtggtg ggcgccgagc tgttgcccgt gcactccaag 1440 

agcttctaca acgaccaggc cgtctactcc caactcatcc- gccagtttac ctctctgacc 1500 
cacgtgttca atcgctttcc cgagaaccag attttggcgc gcccgccagc ccccaccatc 1560 

accaccgtca gtgaaaacgt tcctgctctc acagatcacg ggacgctacc gctgcgcaac 1620 

agcatcggag gagtccagcg agtgaccatt actgacgcca gacgccgcac ctgcccctac 1680 

gtttacaagg ccctgggcat agtctcgccg cgcgtcctat cgagccgcac tttttga 1737 

<210> 5 

< 211 > 20 
<212> DNA 

<213 > adenovirus serotype 5 
<400> 5 

gaacaggagg tgagcttaga 20 

< 210 > 6 
<211> 43 

<212> DNA 

<213 > adenovirus serotype 5 
<40 0> 6 

tccgcctcca tttagtgaac agttaggaga tggagctggt gtg 43 

<210> 7 

<211> 44 

<212 > DNA 

<213> adenovirus serotype 5 
<40 0> 7 

tcactaaatg gaggcggaga tgctaaactc actttggtct taac 44 

<210> 8 
<211> 20 
<212 > DNA 

<213 > adenovirus serotype 5 
<40 0> 8 

gtggcaggtt gaatactagg 20 

<210> 9 

< 211 > 8 
<212> PRT 

<213 > adenovirus serotype 5 
<400> 9 

His Ala lie Arg Gly Asp Thr Phe 
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<400> 15 

ggctccggct ccgagaggtg ggctcacagt ggttacattt 

<210> 16 
<211> 32 

<212> DNA 

<213 > Artificial Sequence 
< 220 > 

<223 > oligonucleotide for mutation generation 
<400> 16 

ggagccggag cctcaaacat aaacctggaa at 
<210> 17 

<211> 27 

<212 > DNA 

<213> Artificial Sequence 
< 220 > 

<223 > amplification primer 
<400> 17 

ctctagaaat ggacggaatt attacag 

<210> 18 
<211> 32 

<212 > DNA 

<213> Artificial Sequence 
< 220 > 

<223 > amplification primer 
<400> 18 

tcttggtcat ctgcaacaac atgaagatag tg 

<210> 19 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
< 220 > 

<223 > amplification primer 
<400> 19 

gttgttgcag atgaccaaga gagtccggct ca 

< 210 > 20 
<211> 73 

<212> DNA 

<213 > Artificial Sequence 
< 220 > 

<223 > amplification primer 
<400> 20 

agcaattgaa aaataaacac gttgaaacat aacacaaacg attctttagt tgtcgtcttc 
tgtaatgtaa gaa 

< 210 > 21 
<211> 24 

<212> DNA 

<213 > Artificial Sequence 
< 220 > 

<223> amplification primer 



40 



32 



27 



32 



32 



60 

73 



<400> 



21 
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<210> 


28 


<211> 


54 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 

<223> 


amplification primer 


<4 0 0 > 


28 



tgcgcaaaaa caatcaccac gacaatcaca atgtacattg gaagaaatca tacg 

<210> 29 

<211> 54 

<212> DNA 

<213> Artificial Sequence 
< 220 > 

<223> amplification primer 
<400> 29 

acattgtgat tgtcgtggtg attgtttttg cgcatatgcc atacaatttg aatg 

<210> 30 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
< 220 > 

<22 3 > RGD targeting peptide 
<400> 30 

His Cys Asp Cys Arg Gly Asp Cys Phe Cys 
1 5 10 

<210> 31 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
< 220 > 

<223> amplification primer 
<400> 31 

ttcttttcat ctgcaacaac atgaagatag tg 

<210> 32 

<211> 32 

<212> DNA 

<213 > Artificial Sequence 
< 220 > 

<223 > amplification primer 
<400> 32 

gttgttgcag atgaaaagaa ccagaattga ag 

<210> 33 

<211> 73 

<212> DNA 

<213 > Artificial Sequence 
< 220 > 

<223 > amplification primer 

<40 0> 33 .. 

tgcaattgaa aaataaacac gttgaaacat aacacaaacg attctttatt cttcagttat 

gtagcaaaat aca 



54 



54 



32 



32 



60 

73 
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<213> Artificial Sequence 
< 220 > 

<223> amplification primer 
<400> 40 

ttacatgcac atctcgggcc aggac 25 

<210> 41 
<211> 7607 
<212 > DNA 

<213> Artificial Sequence 
< 220 > 

<223 > Plasmid GRE5-El-SV40-Hygro 
<400> 41 

tctagaagat ccgctgtaca ggatgttcta gctactttat tagatccgct gtacaggatg 60 

ttctagctac tttattagat ccgctgtaca ggatgttcta gctactttat tagatccgct 120 

gtacaggatg ttctagctac tttattagat ccgtgtacag gatgttctag ctactttatt 180 

agatcgatct cctggccgtt cggggtcaaa aaccaggttt ggctataaaa gggggtgggg 240 

gcgcgttcgt cctcactctc ttccgcatcg ctgtctgcga gggccaggat cgatcctgag 300 

aacttcaggg tgagtttggg gacccttgat tgttctttct ttttcgctat tgtaaaattc 360 

atgttatatg gagggggcaa agttttcagg gtgttgttta gaatgggaag atgtcccttg 420 

tatcaccatg gaccctcatg ataattttgt ttctttcact ttctactctg ttgacaacca 480 

ttgtctcctc ttattttctt ttcattttct gtaacttttt cgttaaactt tagcttgcat 540 

ttgtaacgaa tttttaaatt cacttttgtt tatttgtcag attgtaagta ctttctctaa 600 

tcactttttt ttcaaggcaa tcagggtata ttatattgta cttcagcaca gttttagaga 660 

acaattgtta taattaaatg ataaggtaga atatttctgc atataaattc tggctggcgt 720 

ggaaatattc ttattggtag aaacaactac atcctggtca tcatcctgcc tttctcttta 780 

tggttacaat gatatacact gtttgagatg aggataaaat actctgagtc caaaccgggc 840 
ccctctgcta accatgttca tgccttcttc tttttcctac agctcctggg caacgtgctg 900 
gttattgtgc tgtctcatca ttttggcaaa gaattagatc taagcttctg cagctcgagg 960 
actcggtcga ctgaaaatga gacatattat ctgccacgga ggtgttatta ccgaagaaat 1020 
ggccgccagt cttttggacc agctgatcga agaggtactg gctgataatc ttccacctcc 1080 
tagccatttt gaaccaccta cccttcacga actgtatgat ttagacgtga cggcccccga 1140 
agatcccaac gaggaggcgg tttcgcagat ttttcccgac tctgtaatgt tggcggtgca 1200^ 
ggaagggatt gacttactca cttttccgcc ggcgcccggt tctccggagc cgcctcacct 1260 
ttcccggcag cccgagcagc cggagcagag agccttgggt ccggtttcta tgccaaacct 1320 
tgtaccggag gtgatcgatc ttacctgcca cgaggctggc tttccaccca gtgacgacga 1380 
ggatgaagag ggtgaggagt ttgtgttaga ttatgtggag caccccgggc acggttgcag 1440 
gtcttgtcat tatcaccgga ggaatacggg ggacccagat attatgtgtt cgctttgcta 1500 
tatgaggacc tgtggcatgt ttgtctacag taagtgaaaa ttatgggcag tgggtgatag 1560 
agtggtgggt ttggtgtggt aatttttttt ttaattttta cagttttgtg gtttaaagaa 1620 
ttttgtattg tgattttttt aaaaggtcct gtgtctgaac ctgagcctga gcccgagcca 1680 
gaaccggagc ctgcaagacc tacccgccgt cctaaaatgg cgcctgctat cctgagacgc 1740 
ccgacatcac ctgtgtctag agaatgcaat agtagtacgg atagctgtga ctccggtcct 1800 
tctaacacac ctcctgagat acacccggtg gtcccgctgt gccccattaa accagttgcc 1860 
gtgagagttg gtgggcgtcg ccaggctgtg gaatgtatcg aggacttgct taacgagcct 1920 
gggcaacctt tggacttgag ctgtaaacgc cccaggccat aaggtgtaaa cctgtgattg 1980 
cgtgtgtggt taacgccttt gtttgctgaa tgagttgatg taagtttaat aaagggtgag 2040 
ataatgttta acttgcatgg cgtgttaaat ggggcggggc ttaaagggta tataatgcgc 2100 
cgtgggctaa tcttggttac atctgacctc atggaggctt gggagtgttt ggaagatttt 2160 
tctgctgtgc gtaacttgct ggaacagagc tctaacagta cctcttggtt ttggaggttt 2220 
ctgtggggct catcccaggc aaagttagtc tgcagaatta aggaggatta caagtgggaa 2280 
tttgaagagc ttttgaaatc ctgtggtgag ctgtttgatt ctttgaatct gggtcaccag 2340 
gcgcttttcc aagagaaggt catcaagact ttggattttt ccacaccggg gcgcgctgcg 2400 
gctgctgttg cttttttgag ttttataaag gataaatgga gcgaagaaac ccatctgagc 2460 
ggggggtacc tgctggattt tctggccatg catctgtgga gagcggttgt gagacacaag 2520 
aatcgcctgc tactgttgtc ttccgtccgc ccggcgataa taccgacgga ggagcagcag 2580 
cagcagcagg aggaagccag gcggcggcgg caggagcaga gcccatggaa cccgagagcc 2640 
ggcctggacc ctcgggaatg aatgttgtac aggtggctga actgtatcca gaactgagac 2700 
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ccgctgttga gatccagttc gatgtaaccc actcgtgcac ccaactgatc ttcagcatct 6420 
tttactttca ccagcgtttc tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag 6480 
ggaataaggg cgacacggaa atgttgaata ctcatactct tcctttttca atattattga 6540 
agcatttatc agggttattg tctcatgagc ggatacatat ttgaatgtat ttagaaaaat 6600 
aaacaaatag gggttccgcg cacatttccc cgaaaagtgc cacctgacgt ctaagaaacc 6660 
attattatca tgacattaac ctataaaaat aggcgtatca cgaggcccct ttcgtctcgc 6720 
gcgtttcggt gatgacggtg aaaacctctg acacatgcag ctcccggaga cggtcacagc 6780 
ttgtctgtaa gcggatgccg ggagcagaca agcccgtcag ggcgcgtcag cgggtgttgg 6840 
cgggtgtcgg ggctggctta actatgcggc atcagagcag attgtactga gagtgcacca 6900 
tatgcggtgt gaaataccgc acagatgcgt aaggagaaaa taccgcatca ggaaattgta 6960 
agcgttaata ttttgttaaa attcgcgtta aatttttgtt aaatcagctc attttttaac 7020 
caataggccg aaatcggcaa aatcccttat aaatcaaaag aatagaccga gatagggttg 7080 
agtgttgttc cagtttggaa caagagtcca ctattaaaga acgtggactc caacgtcaaa 7140 
gggcgaaaaa ccgtctatca gggcgatggc ccactacgtg aaccatcacc ctaatcaagt 7200 
tttttggggt cgaggtgccg taaagcacta aatcggaacc ctaaagggag cccccgattt 7260 
agagcttgac ggggaaagcc ggcgaacgtg gcgagaaagg aagggaagaa agcgaaagga 7320 
gcgggcgcta gggcgctggc aagtgtagcg gtcacgctgc gcgtaaccac cacacccgcc 7380 
gcgcttaatg cgccgctaca gggcgcgtcc cattcgccat tcaggctgcg caactgttgg 7440 
gaagggcgat cggtgcgggc ctcttcgcta ttacgccagc tggcgaaagg gggatgtgct 7500 
gcaaggcgat taagttgggt aacgccaggg ttttcccagt cacgacgttg taaaacgacg 7560 
gccagtgaat tgtaatacga ctcactatag ggcgaattaa ttcgggg 7607 

<210> 42 
<211> 11600 
<212> DHA 

<213> Artificial Sequence 
< 220 > 

<223> Plasmid MMTV-E2a-SV40 -Neo 
<400> 42 

gaattccgca ttgcagagat attgtattta agtgcctagc tcgatacaat aaacgccatt 60 

tgaccattca ccacattggt gtgcacctcc aagcttgggc agaaatggtt gaactcccga 120 

gagtgtccta cacctagggg agaagcagcc aaggggttgt ttcccaccaa ggacgacccg 180 

tctgcgcaca aacggatgag cccatcagac aaagacatat tcattctctg ctgcaaactt 240 

ggcatagctc tgctttgcct ggggctattg ggggaagttg cggttcgtgc tcgcagggct 300 

ctcacccttg actcttttaa tagctcttct gtgcaagatt acaatctaaa caattcggag 360 

aactcgacct tcctcctgag gcaaggacca cagccaactt cctcttacaa gccgcatcga 420 

ttttgtcctt cagaaataga aataagaatg cttgctaaaa attatatttt taccaataag 480 

accaatccaa taggtagatt attagttact atgttaagaa atgaatcatt atcttttagt 540 

actattttta ctcaaattca gaagttagaa atgggaatag aaaatagaaa gagacgctca 600 
acctcaattg aagaacaggt gcaaggacta ttgaccacag gcctagaagt aaaaaaggga 660 
aaaaagagtg tttttgtcaa aataggagac aggtggtggc aaccagggac ttatagggga 720 
ccttacatct acagaccaac agatgccccc ttaccatata caggaagata tgacttaaat 780 
tgggataggt gggttacagt caatggctat aaagtgttat atagatccct cccttttcgt 840 
gaaagactcg ccagagctag acctccttgg tgtatgttgt ctcaagaaga aaaagacgac 900 
atgaaacaac aggtacatga ttatatttat ctaggaacag gaatgcactt ttggggaaag 960 
attttccata ccaaggaggg gacagtggct ggactaatag aacattattc tgcaaaaact 1020 
catggcatga gttattatga atagccttta ttggcccaac cttgcggttc ccagggctta 1080 
agtaagtttt tggttacaaa ctgttcttaa aacgaggatg tgagacaagt ggtttcctga 1140 
cttggtttgg tatcaaaggt tctgatctga gctctgagtg ttctattttc ctatgttctt 1200 
ttggaattta tccaaatctt atgtaaatgc ttatgtaaac caagatataa aagagtgctg 1260 
attttttgag taaacttgca acagtcctaa cattcacctc ttgtgtgttt gtgtctgttc 1320 
gccatcccgt ctccgctcgt cacttatcct tcactttcca gagggtcccc ccgcagaccc 1380 
cggcgaccct caggtcggcc gactgcggca gctggcgccc gaacagggac cctcggataa 1440 
gtgacccttg tctctatttc tactatttgg tgtttgtctt gtattgtctc tttcttgtct 1500 
ggctatcatc acaagagcgg aacggactca ccatagggac caagctagcg cttctcgtcg 1560 
cgtccaagac cctcaaagat ttttggcact tcgttgagcg aggcgatatc aggtatgaca 1620 
gcgccctgcc gcaaggccag ctgcttgtcc gctcggctgc ggttggcacg gcaggatagg 1680 
ggtatcttgc agttttggaa aaagatgtga taggtggcaa gcacctctgg cacggcaaat 1740 
acggggtaga agttgaggcg cgggttgggc tcgcatgtgc cgttttcttg gcgtttgggg 1800 
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aatgcaattg ttgttgttaa cttgtttatt 
agcatcacaa atttcacaaa taaagcattt 
aaactcatca atgtatctta tcatgtctgg 
tgtggaaagt ccccaggctc ccca.gca.g 9 c 
tcagcaacca ggtgtggaaa gtccccaggc 
catctcaatt agtcagcaac catagtcccg 
ccgcccagtt ccgcccattc tccgccccat 
gccgaggccg cctcggcctc tgagctattc 
ctaggctttt gcaaaaagct tcacgctgcc 
ggaagcggaa cacgtagaaa gccagtccgc 
gctactgggc tatctggaca agggaaaacg 
gtgggcttac atggcgatag ctagactggg 
tgccagctgg ggcgccctct ggtaaggttg 
tcttgccgcc aaggatctga tggcgcaggg 
gatcgtttcg catgattgaa caagatggat 
agaggctatt cggctatgac tgggcacaac 
tccggctgtc agcgcagggg cgcccggttc 
tgaatgaact gcaggacgag gcagcgcggc 
gcgcagctgt gctcgacgtt gtcactgaag 
tgccggggca ggatctcctg tcatctcacc 
ctgatgcaat gcggcggctg catacgcttg 
cgaaacatcg catcgagcga gcacgtactc 
atctggacga agagcatcag gggctcgcgc 
gcatgcccga cggcgaggat ctcgtcgtga 
tggtggaaaa tggccgcttt tctggattca 
gctatcagga catagcgttg gctacccgtg 
ctgaccgctt cctcgtgctt tacggtatcg 
atcgccttct tgacgagttc ttctgagcgg 
gacgcccaac ctgccatcac gagatttcga 
cttcggaatc gttttccggg acgccggctg 
ggagttcttc gcccaccccg ggctcgatcc 
gctggacgac ctcgcggagt tctaccggca 
cagcggctat ccgcgcatcc atgcccccga 
tttggtcccg gatctttgtg aaggaacctt 
tacctacaga gatttaaagc tctaaggtaa 
aactactgat tctaattgtt tgtgtatttt 
agcagtggtg gaatgccttt aatgaggaaa 
gtgatgatga ggctactgct gactctcaac 
tagaagaccc caaggacttt ccttcagaat 
gtaatagaac tcttgcttgc tttgctattt 
acaagaaaat tatggaaaaa tattctgtaa 
ataacatact gttttttctt actccacaca 
ctcaaaaatt gtgtaccttt agctttttaa 
tgtatagtgc cttgactaga gatcataatc 
gctttaaaaa acctcccaca cctccccctg 
gttgttaact tgtttattgc agcttataat 
ttcacaaata aagcattttt ttcactgcat 
gtatcttatc atgtctggat ccccaggaag 
ctctacttga gaggacattc caatcatagg 
aattaggtca cttaacaaaa aggaaattgg 
ggtaatttta aaatatctgg gaagtccctt 
cagcccacaa atgtcaacag cagaaacata 
accctgctca tcaagaagca ctgtggttgc 
attttcccca cctgtgtagg ttccaaaata 
aacccagcac tccactggat aagcattatc 
catctgctga ctgtcaactg tagcattttt 
cctgtagttt gctaacacac cctgcagctc 
aaaatttgac ccttgaatgg gttttccagc 
atgcaagttt aacatagcag ttaccccaat 
catcaaaata tttccacagg ttaagtcctc 
cgaaagggcc tcgtgatacg cctattttta 



gcagcttata atggttacaa ataaagcaat 5520 
ttttcactgc attctagttg tggtttgtcc 5580 
atccggctgt ggaatgtgtg tcagttaggg 5640 
agaagtatgc aaagcatgca tctcaattag 5700 
tccccagcag gcagaagtat gcaaagcatg 5760 
cccctaactc cgcccatccc gcccctaact 5820 
ggctgactaa ttttttttat ttatgcagag 5880 
cagaagtagt gaggaggctt ttttggaggc 5940 
gcaagcactc agggcgcaag ggctgctaaa 6000 
agaaacggtg ctgaccccgg atgaatgtca 6060 
caagcgcaaa' gagaaagcag gtagcttgca 6120 
cggttttatg gacagcaagc gaaccggaat 6180 
ggaagccctg caaagtaaac tggatggctt 6240 
gatcaagatc tgatcaagag acaggatgag 6300 
tgcacgcagg ttctccggcc gcttgggtgg 6360 
agacaatcgg ctgctctgat gccgccgtgt 6420 
tttttgtcaa gaccgacctg tccggtgccc 6480 
tatcgtggct ggccacgacg ggcgttcctt 6540 
c 999 aa 999 a ctggctgcta ttgggcgaag 6600 
ttgctcctgc cgagaaagta tccatcatgg 6660 
atccggctac ctgcccattc gaccaccaag 6720 
ggatggaagc cggtcttgtc gatcaggatg 6780 
cagccgaact gttcgccagg ctcaaggcgc 6840 
cccatggcga tgcctgcttg ccgaatatca 6900 
tcgactgtgg ccggctgggt gtggcggacc 6960 
atattgctga agagcttggc ggcgaatggg 7020 
ccgctcccga ttcgcagcgc atcgccttct 7080 
gactctgggg ttcgaaatga ccgaccaagc 7140 
ttccaccgcc gccttctatg aaaggttggg 7200 
gatgatcctc cagcgcgggg atctcatgct 7260 
cctcgcgagt tggttcagct gctgcctgag 7320 
gtgcaaatcc gtcggcatcc aggaaaccag 7380 
actgcaggag tggggaggca cgatggccgc 7440 
acttctgtgg tgtgacataa ttggacaaac 7500 
atataaaatt tttaagtgta taatgtgtta 7560 
agattccaac ctatggaact gatgaatggg 7620 
acctgttttg ctcagaagaa atgccatcta 7680 
attctactcc tccaaaaaag aagagaaagg 7740 
tgctaagttt tttgagtcat gctgtgttta 7800 
acaccacaaa ggaaaaagct gcactgctat 7860 
cctttataag taggcataac agttataatc 7920 
ggcatagagt gtctgctatt aataactatg 7980 
tttgtaaagg ggttaataag gaatatttga 8040 
agccatacca catttgtaga ggttttactt 8100 
aacctgaaac ataaaatgaa tgcaattgtt 8160 
ggttacaaat aaagcaatag catcacaaat 8220 
tctagttgtg gtttgtccaa actcatcaat 8280 
ctcctctgtg tcctcataaa ccctaacctc 8340 
ctgcccatcc accctctgtg tcctcctgtt 8400 
gtaggggttt ttcacagacc gctttctaag 8460 
ccactgctgt gttccagaag tgttggtaaa 8520 
caagctgtca gctttgcaca agggcccaac 8580 
tgtgttagta atgtgcaaaa caggaggcac 8640 
tctagtgttt tcatttttac ttggatcagg 8700 
cttatccaaa acagccttgt ggtcagtgtt 8760 
tggggttaca gtttgagcag gatatttggt 8820 
caaaggttcc ccaccaacag caaaaaaatg 8880 
accattttca tgagtttttt gtgtccctga 8940 
aacctcagtt ttaacagtaa cagcttccca 9000 
atttaaatta ggcaaaggaa ttcttgaaga 9060 
taggttaatg tcatgataat aatggtttct 9120 
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atctgagggg actagggtgt gtttaggcga aaagcggggc ttcggttgta cgcggttagg 600 
agtcccctca ggatatagta gtttcgcttt tgcataggga gggggaaatg tagtcttatg 660 
caatactctt gtagtcttgc aacatggtaa cgatgagtta gcaacatgcc ttacaaggag 720 
agaaaaagca ccgtgcatgc cgattggtgg aagtaaggtg gtacgatcgt gccttattag 780 
gaaggcaaca gacgggtctg acatggattg gacgaaccac tgaattccgc attgcagaga 840 
tattgtattt aagtgcctag ctcgatacaa taaacgccat ttgaccattc accacattgg 900 
tgtgcacctc cggccctggc cactctcttc cgcatcgctg tctgcggggg ccagctgttg 960 
ggctcgcggt tgaggacaaa ctcttcgcgg tctttccagt actcttggat cggaaacccg 1020 
tcggcctccg aacggtactc cgccgccgag ggacctgagc gagtccgcat cgaccggatc 1080 
ggaaaacctc tcgagaaagg cgtgtaacca gtcacagtcg ctctagaact agtggatccc 1140 
ccqggctgca ggaattcgat ctagatggat aaaggtccaa aaaagaagag aaaggtagaa 1200 
gaccccaagg actttccttc agaattgcta agttttttga gtgattcact ggccgtcgtt 1260 
ttacaacgtc gtgactggga aaaccctggc gttacccaac ttaatcgcct tgcagcacat 1320 
ccccctttcg ccagctggcg taatagcgaa gaggcccgca ccgatcgccc ttcccaacag 1380 
ttgcgcagcc tgaatggcga atggcgcttt gcctggtttc cggcaccaga agcggtgccg 1440 
gaaagctggc tggagtgcga tcttcctgag gccgatactg tcgtcgtccc ctcaaactgg 1500 
cagatgcacg gttacgatgc gcccatctac accaacgtaa cctatcccat tacggtcaat 1560- 
ccgccgtttg ttcccacgga gaatccgacg ggttgttact cgctcacatt taatgttgat 1620 
gaaagctggc tacaggaagg ccagacgcga attatttttg atggcgttaa ctcggcgttt 1680 
catctgtggt gcaacgggcg ctgggtcggt tacggccagg acagtcgttt gccgtctgaa 1740 
tttgacctga gcgcattttt acgcgccgga gaaaaccgcc tcgcggtgat ggtgctgcgt 1800 
tggagtgacg gcagttatct ggaagatcag gatatgtggc ggatgagcgg cattttccgt 1860 
gacgtctcgt tgctgcataa accgactaca caaatcagcg atttccatgt tgccactcgc 1920 
tttaatgatg atttcagccg cgctgtactg gaggctgaag ttcagatgtg cggcgagttg 1980 . 
cgtgactacc tacgggtaac agtttcttta tggcagggtg aaacgcaggt cgccagcggc 2040 
accgcgcctt tcggcggtga aattatcgat- gagcgtggtg gttatgccga tcgcgtcaca 2100 
ctacgtctga acgtcgaaaa cccgaaactg tggagcgccg aaatcccgaa tctctatcgt 2160 
qcggtggttg aactgcacac cgccgacggc acgctgattg aagcagaagc ctgcgatgtc 2220 
ggtttccgcg aggtgcggat tgaaaatggt ctgctgctgc tgaacggcaa gccgttgctg 2280 
attcgaggcg ttaaccgtca cgagcatcat cctctgcatg gtcaggtcat ggatgagcag 2340 
acgatggtgc aggatatcct gctgatgaag cagaacaact ttaacgccgt gcgctgttcg 2400 
cattatccga accatccgct gtggtacacg ctgtgcgacc gctacggcct gtatgtggtg 2460 
gatgaagcca atattgaaac ccacggcatg gtgccaatga atcgtctgac cgatgatccg 2520 
cgctggctac cggcgatgag cgaacgcgta acgcgaatgg tgcagcgcga tcgtaatcac 2580 
ccgagtgtga tcatctggtc gctggggaat gaatcaggcc acggcgctaa tcacgacgcg 2640 
ctgtatcgct ggatcaaatc tgtcgatcct tcccgcccgg tgcagtatga aggcggcgga 2700 
gccgacacca cggccaccga tattatttgc ccgatgtacg cgcgcgtgga tgaagaccag 2760 
cccttcccgg ctgtgccgaa atggtccatc aaaaaatggc tttcgctacc tggagagacg 2820 
cgcccgctga tcctttgcga atacgcccac gcgatgggta acagtcttgg cggtttcgct 2880 
aaatactggc aggcgtttcg tcagtatccc cgtttacagg gcggcttcgt ctgggactgg 2940 
gtggatcagt cgctgattaa atatgatgaa aacggcaacc cgtggtcggc ttacggcggt 3000 
gattttggcg atacgccgaa cgatcgccag ttctgtatga acggtctggt ctttgccgac 3060 
cgcacgccgc atccagcgct gacggaagca aaacaccagc agcagttttt ccagttccgt 3120 
ttatccgggc aaaccatcga agtgaccagc gaatacctgt tccgtcatag cgataacgag 3180 
ctcctgcact ggatggtggc gctggatggt aagccgctgg caagcggtga agtgcctctg 3240 
gatgtcgctc cacaaggtaa acagttgatt gaactgcctg aactaccgca gccggagagc 3300 
gccgggcaac tctggctcac agtacgcgta gtgcaaccga acgcgaccgc atggtcagaa 3360 
gccgggcaca tcagcgcctg gcagcagtgg cgtctggcgg aaaacctcag tgtgacgctc 3420 
cccgccgcgt cccacgccat cccgcatctg accaccagcg aaatggattt ttgcatcgag 3480 
ctgggtaata agcgttggca atttaaccgc cagtcaggct ttctttcaca gatgtggatt 3540 
ggcgataaaa aacaactgct gacgccgctg cgcgatcagt tcacccgtgc accgctggat 3600 
aacgacattg gcgtaagtga agcgacccgc attgacccta acgcctgggt cgaacgctgg 3660 
aaggcggcgg gccattacca ggccgaagca gcgttgttgc agtgcacggc agatacactt 3720 
gctgatgcgg tgctgattac gaccgctcac gcgtggcagc atcaggggaa aaccttattt 3780 
atcagccgga aaacctaccg gattgatggt agtggtcaaa tggcgattac cgttgatgtt 3840 
gaagtggcga gcgatacacc gcatccggcg cggattggcc tgaactgcca gctggcgcag 3900 
gtagcagagc gggtaaactg gctcggatta gggccgcaag aaaactatcc cgaccgcctt 3960 
actgccgcct gttttgaccg ctgggatctg ccattgtcag acatgtatac cccgtacgtc 4020 
ttcccgagcg aaaacggtct gcgctgcggg acgcgcgaat tgaattatgg cccacaccag 4080 
tggcqcqqcg acttccagtt caacatcagc cgctacagtc aacagcaact gatggaaacc 4140 
agccatcgcc atctgctgca cgcggaagaa ggcacatggc tgaatatcga cggtttccat 4200 
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agcgcggagg 

agatatgtag 

tgcgagggag 

actatctgcc 

aagctggcgt 

ttgttgacca 

atgatgtcat 

tcgcggtctt 

agcatgtaga 

tatgcctgcg 

actttgaggt 

aagtccgtgc 

atctttcccg 

cggttgttaa 

acaatgtaaa 

tcgtaggtga 

tgagggttgg 

tcgcgaaagg 

gtaagcgggt 

gtcactagag 

ccaaaggccc 

cgaggatgcg 

ttgatgtggt 

aaacgtgcgc 

cgaccgcgca 

tggtcttcta 

gatcggacca 

ttgatgacaa 

tcaggcggga 

aggtgatacc 

catccccgcg 

gatgatgcat 

ccgccgggag 

cgcgtaggtt 

gcgtgaagac 

cggtgtcgtt 

aggcgatctc 

gctccacggt 

ggcctccctc 

tgaccacctg 

gctgaaagag 

agcgtcgcaa 

agaagtccac 

ccagaagacg 

cctcttcttc 

gcggtggggg 

gctcgatcat 

gggggcgcag 

catgcggcag 

cgccgaggga 

ctaaccagtc 

cggggttgtt 

ggcggatggt 

cggccatgcc 

gcctttctac 

ctgcggcggc 

cgaagcccct 

cctgctgcac 

cgcccgtgtt 

ccggctgcga 

cgttgcaagt 



cgtacatgcc 

ggtagcatct 

cgaggaggtc 

tgaagatggc 

ctgtgagacc 

gctcggcggt 

acttatcctg 

tccagtactc 

actggttgac 

cggccttccg 

actggtattt 

gctttttgga 

cgcgaggcat 

ttacctgggc 

gttccaagaa 

gctcttcagg 

aagcgacgaa 

tcctaaactg 

cttgttccca 

gctcatctcc 

ccatccaagt 

agccgatcgg 

gaaagtagaa 

agtactggca 

caaggaagca 

cttcggctgc 

ccacgccgcg 

catcgcgcag 

gctcctgcag 

taatttccag 

gcgcgactac 

ctaaaagcgg 

agggggcagg 

gctggcgaac 

gacgggcccg 

gacggcggcc 

ggccatgaac 

ggcggcgagg 

gttccagacg 

cgcgagattg 

gtagttgagg 

cgtggattcg 

ggcgaagttg 

gatgagctcg 

ttcttcaatc 

aggggggaca 

ctccccgcgg 

ttggaagacg 

ggatacggcg 

cctgagcgag 

acagtcgcaa 

tctggcggag 

cgacagaagc 

ccaggcttcg 

cggcacttct 

ggcggagttt 

catcggctga 

ctgcgtgagg 

gatggtgtaa 

gagctcggtg 

ccgcaccagg 



gcaaatgtcg 

tccaccgcgg 

gggaccgagg 

atgtgagttg 

taccgcgtca 

gacctgcacg 

tccctttttt 

ttggatcgga 

ggcctggtag 

gagcgaggtg 

gaagtcagtg 

acgcggattt 

aaagttgcgt 

ggcgagcacg 

gcgcgggatg 

ggagctgagc 

tgagctccac 

gcgacctatg 

gcggtcccat 

gccgaacttc 

ataggtctct 

gaagaactgg 

gtccctgcga 

gcggtg cac g 

gagtgggaat 

ttgtccttga 

cgagcccaaa 

atgggagctg 

gtttacctcg 

gggctggttg 

ggtaccgcgc 

tgacgcgggc 

ggcacgtcgg 

gcgacgacgc 

gtgagcttga 

tggcgcaaaa 

tgctcgatct 

tcgttggaaa 

cggctgtaga 

agctccacgt 

gtggtggcgg 

ttgatatc cc 

aaaaactggg 

gcgacagtgt 

tcctcttcca 



cggcggcgac 

cgacggcgca 

ccgcccgtca 

ctaacgatgc 

tccgcatcga 

ggtaggctga 

gtgctgctga 

accatgtcct 

ttttgacatc 

tcttctcctt 

ggccgtaggt 

agcagggcta 

gtagactgga 

gtgcagttgg 

tacctgagac 

tactggtatc 



taaacgtaga 

atgctggcgc 

ttgctacggg 

gatgatatgg 

cgcacgaagg 

tctagggcgc 

ttccacagct 

aacccgtcgg 

gcgcagcatc 

tgggtgagcg 

tcgtcgcatc 

ggcagggcga 

gtgatgcgga 

atctcgtcaa 

cccttgatgg 

ccgtgctctg 

aggtcacggg 

gccatttttt 

ccaaggttcg 

atgaccagca 

acatcgtagg 

atctcccgcc 

cgggccgaac 

ggctgtacat 

ttgagcccct 

ccgtctggct 

gtccagatgt 

tccatggtct 

catagacggg 

gtggcggcgt 

ggcgggcggt 

gagcccccgg 

cgccgcgcgc 

ggcggttgat 

gcctgaaaga 

tctcctgcac 

cttcctcctg 

tgcgggccat 

ccacgccccc 

gccgggcgaa 

tgtgttctgc 

ccaaggcctc 

agttgcgcgc 

cgcgcacctc 

taagggcctc 

gacggcgcac 

tggtctcggt 

tgtcccggtt 

atctcaacaa 

ccggatcgga 

gcaccgtggc 

tgatgtaatt 

tgggtccggc 

ggcgcaggtc 

cctcttgtcc 

ggcgccctct 

ggtcggcgac 

agtcatccat 

ccataacgga 

gcgagtaagc 

ccaccaaaaa 



gggg c tctct 

gcacgtaatc 

cgggctgctc 

ttggacgctg 

aggcgtagga 

agtagtccag 

cgcggttgag 

cctccgaacg 

ccttttctac 

caaaggtgtc 

cgccctgctc 

aggtgacatc 

agggtcccgg 

agccgttgat 

aaggcaattt. 

aaagggccca 

ccattagcat 

ctggggtgat 

cggctaggtc 

tgaagggcac 

tgacaaagag 

accaattgga 

actcgtgctg 

cctgcacgag 

cgcctggcgg 

gctcgagggg 

ccgcgcgcgg 

ggagctcccg 

tcagggcgcg 

cgatggcttg 

gggccgcggg 

aggtaggggg 

gggcaggagc 

ctcctgaatc 

gagttcgaca 

gtctcctgag 

gagatctccg 

gagctgcgag 

ttcggcatcg 

gacggcgtag 

cacgaagaag 

aaggcgctcc 

cgacacggtt 

gcgctcaaag 

cccttcttct 

cgggaggcgg 

gacggcgcgg 

atgggttggc 

ttgttgtgta 

aaacctctcg 

gggcggcagc 

aaagtaggcg 

ctgctgaatg 

tttgtagtag 

tgcatctctt 

tcctcccatg 

aacgcgctcg 

gtccacaaag 

ccagttaacg 

cctcgagtca 

gtgcggcggc 



gagtattcca 

gtatagttcg 

tgctcggaag 

gaagacgttg 

gtcgcgcagc 

ggtttccttg 

gacaaactct 

gtaagagcct 

gggtagcgcg 

cctgaccatg 

ccagagcaaa 

gttgaagagt 

cacctcggaa 

gttgtggccc 

tttaagttcc 

gtctgcaaga 

ttgcaggtgg 

gcagtagaag 

tcgcgcggca 

gagctgcttc 

acgctcggtg 

ggagtggcta 

gcttttgtaa 

gttgacctga 

gtttggctgg 

agttacggtg 

cggtcggagc 

cggcgtcagg 

ggctagatcc 

caagaggccg 

ggtgtccttg 

ggctccggac 

tggtgctgcg 

tggcgcctct 

gaatcaattt 

ttgtcttgat 

cgtccggctc 

aaggcgttga 

cgggcgcgca 

tttcgcaggc 

tacataaccc 

atggcctcgt 

aactcctcct 

gctacagggg 

tcttctggcg 

tcgacaaagc 

ccgttctcgc 

ggggggctgc 

ggtactccgc 

agaaaggcgt 

gggcggcggt 

gtcttgagac 

cgcaggcggt 

tcttgcatga 

gcatctatcg 

cgtgtgaccc 

gctaatatgg 

cggtggtatg 

gtctggtgac 

aatacgtagt 

ggctggcggt 



7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 
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8640 

8700 

8760 
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8880 
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9000 
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9120 
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9300 
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9600 
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9900 

9960 
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10200 

10260 
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10380 

10440 

10500 
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10740 

10800 
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11280 

11340 
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cccgtttgcg caccttcgcc ccaggctggg gagaatgttt taaaaaaaaa aaagcatgat 15240 

gcaaaataaa aaact caeca aggccatggc accgagcgtt ggttttcttg tattcccctt 15300 

agtatgegge gegeggegat gtatgaggaa ggtcctcctc cctcctacga gagtgtggtg 15360 

agcgcggcgc cagtggcggc ggcgctgggt tctcccttcg atgctcccct ggacccgccg 15420 

tttgtgcctc cgcggtacct gcggcctacc ggggggagaa acagcatccg ttactctgag 15480 

ttggcacccc tattcgacac cacccgtgtg tacctggtgg acaacaagtc aacggatgtg 15540 

gcatccctga actaccagaa cgaccacagc aactttctga ccacggtcat tcaaaacaat 15600 

gactacagcc cgggggaggc aagcacacag accatcaatc ttgacgaccg gtcgcactgg 15660 

ggcggcgacc tgaaaaccat cctgcatacc aacatgccaa atgtgaacga gttcatgttt 15720 

accaataagt ttaaggcgcg ggtgatggtg tcgcgcttgc ctactaagga caatcaggtg 15780 

gagctgaaat acgagtgggt ggagttcacg ctgcccgagg gcaactactc cgagaccatg 15840 

accatagacc ttatgaacaa egegategtg gagcactact tgaaagtggg cagacagaac 15900 

gggg ttct 99 aaagcgacat cggggtaaag tttgacaccc gcaacttcag actggggttt 15960 

gaccccgtca ctggtcttgt catgcctggg gtatatacaa aegaageett ccatccagac 16020 

atcattttgc tgccaggatg cggggtggac ttcacccaca gccgcctgag caacttgttg 1608a * 

ggcatccgca agcggcaacc cttccaggag ggctttagga tcacctacga tgatctggag 16140 

ggtggtaaca ttcccgcact gttggatgtg gacgcctacc aggegagett gaaagatgac 16200 

accgaacagg gcgggggtgg cgcaggcggc agcaacagca gtggcagcgg egeggaagag 16260 > 

aactccaacg cggcagccgc ggcaatgcag ccggtggagg acatgaacga tcatgccatt 16320 

cgcggcgaca cctttgccac acgggctgag gagaagegeg ctgaggccga agcagcggcc 1638a 

gaagctgccg cccccgctgc gcaacccgag gtegagaage ctcagaagaa accggtgatc 16440 

aaacccctga cagaggacag caagaaacgc agttacaacc taataagcaa tgacagcacc 16500 

ttcacccagt accgcagctg gtaccttgca tacaactacg gcgaccctca gaccggaatc 16560 

cgctcatgga ccctgctttg cactcctgac gtaacctgcg geteggagea ggtctactgg 16620 

tcgttgccag acatgatgca agaccccgtg accttccgct ccacgcgcca gatcagcaac 16680 

tttccggtgg tgggcgccga gctgttgccc gtgcactcca agagetteta caacgaccag 16740 

gccgtctact cccaactcat ccgccagttt acctctctga cccacgtgtt caatcgcttt 16800 

cccgagaacc agattttggc gcgcccgcca gcccccacca tcaccaccgt cagtgaaaac 16860 

gttcctgctc tcacagatca egggaegeta ccgctgcgca acagcatcgg aggagtccag 16920 
cgagtgacca ttactgacgc cagacgccgc acctgcccct acgtttacaa ggccctgggc 16980 
atagtetege cgcgcgtcct atcgagccgc actttttgag caagcatgtc catccttata 17040 
tcgcccagca ataacacagg ctggggcctg cgcttcccaa gcaagatgtt tggcggggcc 17100 
aagaageget ccgaccaaca cccagtgcgc gtgcgcgggc actaccgcgc gccctggggc 17160 
gcgcacaaac gcggccgcac tgggcgcacc accgtcgatg acgccatcga cgcggtggtg 17220 
gaggaggege gcaactacac gcccacgccg ccaccagtgt ccacagtgga cgcggccatt 17280 
cagaccgtgg tgegeggage ccggcgctat gctaaaatga agagaeggeg gaggegegta 17340 
gcacgtcgcc accgccgccg acccggcact gccgcccaac gcgcggcggc ggccctgctt 17400 
aaccgcgcac gtcgcaccgg ccgacgggcg gccatgcggg ccgctcgaag gctggccgcg 17460 
ggtattgtca ctgtgccccc caggtccagg egaegagegg ccgccgcagc agccgcggcc 17520 
attagtgeta tgactcaggg tcgcaggggc aacgtgtatt gggtgcgcga ctcggttagc 17580 
ggcctgcgcg tgcccgtgcg cacccgcccc ccgcgcaact agattgcaag aaaaaactac 17640 
ttagactcgt actgttgtat gtatccagcg gcggcggcgc gcaacgaagc tatgtccaag 17700 
cgcaaaatca aagaagagat gctccaggtc atcgcgccgg agatetatgg ccccccgaag 17760 
aaggaagage aggattacaa gccccgaaag ctaaagcggg tcaaaaagaa aaagaaagat 17820 
gatgatgatg aaettgaega cgaggtggaa ctgctgcacg ctaccgcgcc caggcgacgg 17880 
gtacagtgga aaggtcgacg cgtaaaacgt gttttgcgac ccggcaccac cgtagtcttt 17940 
acgcccggtg agcgctccac ccgcacctac aagcgcgtgt atgatgaggt gtaeggegae 18000 
gaggacctgc ttgageagge caacgagcgc ctcggggagt ttgcctacgg aaageggeat 18060 
aaggacatgc tggcgttgcc gctggacgag ggcaacccaa cacctagcct aaagcccgta 18120 
acactgcagc aggtgctgcc cgcgcttgca ccgtccgaag aaaagcgcgg cctaaagcgc 18180 
gagtctggtg acttggcacc caccgtgcag ctgatggtac ccaagcgcca gcgactggaa 18240 
gatgtcttgg aaaaaatgac cgtggaacct gggctggagc ccgaggtccg cgtgcggcca 18300 
atcaagcagg tggcgccggg actgggcgtg cagaccgtgg acgttcagat acccactacc 18360 
agtagcacca gtattgccac cgccacagag ggcatggaga cacaaacgtc cccggttgcc 18420 
tcagcggtgg cggatgccgc ggtgcaggcg gtcgctgcgg ccgcgtccaa gacctctacg 18480 
gaggtgcaaa cggacccgtg gatgtttcgc gtttcagccc cccggcgccc gcgcggttcg 18540 
aggaagtacg gcgccgccag cgcgctactg cccgaatatg ccctacatcc ttccattgcg 18600 
cctacccccg gctatcgtgg ctacacctac cgccccagaa gacgagcaac tacccgacgc 18660 
cgaaccacca ctggaacccg ccgccgccgt cgccgtcgcc agcccgtgct ggccccgatt 18720 
tccgtgcgca gggtggctcg egaaggagge aggaccctgg tgctgccaac agcgcgctac 18780 
caccccagca tcgtttaaaa gccggtcttt gtggttcttg cagatatggc cctcacctgc 18840 
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gaaggacagg cctaccctgc taacttcccc tatccgctta taggcaagac cgcagttgac 22560 

agcattaccc agaaaaagtt tctttgcgat cgcacccttt ggcgcatccc attctccagt 22620 

aactttatgt ccatgggcgc actcacagac ctgggccaaa accttctcta cgccaactcc 22680 

gcccacgcgc tagacatgac ttttgaggtg gatcccatgg acgagcccac ccttctttat 22740 

gttttgtttg aagtctttga cgtggtccgt gtgcaccggc cgcaccgcgg cgtcatcgaa 22800 

accgtgtacc tgcgcacgcc cttctcggcc ggcaacgcca caacataaag aagcaagcaa 22860 

catcaacaac agctgccgcc atgggctcca gtgagcagga actgaaagcc attgtcaaag 22920 

atcttggttg tgggccatat tttttgggca cctatgacaa gcgctttcca ggctttgttt 22980 

ctccacacaa gctcgcctgc gccatagtca atacggccgg tcgcgagact gggggcgtac 23040 

actggatggc ctttgcctgg aacccgcact caaaaacatg ctacctcttt gagccctttg 23100 

gcttttctga ccagcgactc aagcaggttt accagtttga gtacgagtca ctcctgcgcc 23160 

gtagcgccat tgcttcttcc cccgaccgct gtataacgct ggaaaagtcc acccaaagcg 23220 

tacaggggcc caactcggcc gcctgtggac tattctgctg catgtttctc cacgcctttg 23280 

ccaactggcc ccaaactccc atggatcaca accccaccat gaaccttatt accggggtac 23340 

ccaactccat gctcaacagt ccccaggtac agcccaccct gcgtcgcaac caggaacagc 23400 

tctacagctt cctggagcgc cactcgccct acttccgcag ccacagtgcg cagattagga 23460 

gcgccacttc tttttgtcac ttgaaaaaca tgtaaaaata atgtactaga gacactttca 23520 

ataaaggcaa atgcttttat ttgtacactc tcgggtgatt atttaccccc acccttgccg 23580 

tctgcgccgt ttaaaaatca aaggggttct gccgcgcatc gctatgcgcc actggcaggg 23640 

acacgttgcg atactggtgt ttagtgct cc acttaaactc aggcacaacc atccgcggca 23700 

gctcggtgaa gttttcactc cacaggctgc gcaccatcac caacgcgttt agcaggtcgg 23760 

gcgccgatat cttgaagtcg cagttggggc ctccgccctg cgcgcgcgag ttgcgataca 23820 

cagggttgca gcactggaac actatcagcg ccgggtggtg cacgctggcc agcacgctct 23880 

tgtcggagat cagatccgcg tccaggtcct ccgcgttgct cagggcgaac ggagtcaact 23940 

ttggtagctg ccttcccaaa aagggcgcgt gcccaggctt tgagttgcac tcgcaccgta 24000 

gtggcatcaa aaggtgaccg tgcccggtct gggcgttagg atacagcgcc tgcataaaag 24060 

ccttgatctg cttaaaagcc acctgagcct ttgcgccttc agagaagaac atgccgcaag 24120 

acttgccgga aaactgattg gccggacagg ccgcgtcgtg cacgcagcac cttgcgtcgg 24180 

tgttggagat ctgcaccaca tttcggcccc accggttctt cacgatcttg gccttgctag 24240 

actgctcctt cagcgcgcgc tgcccgtttt cgctcgtcac atccatttca atcacgtgct 24300 

ccttatttat cataatgctt ccgtgtagac acttaagctc gccttcgatc tcagcgcagc 24360 

ggtgcagcca caacgcgcag cccgtgggct cgtgatgctt gtaggtcacc tctgcaaacg 24420 

actgcaggta cgcctgcagg aatcgcccca tcatcgtcac aaaggtcttg ttgctggtga 24480 

aggtcagctg caacccgcgg tgctcctcgt tcagccaggt cttgcatacg gccgccagag 24540 

cttccacttg gtcaggcagt agtttgaagt tcgcctttag atcgttatcc acgtggtact 24600 

tgtccatcag cgcgcgcgca gcctccatgc ccttctccca cgcagacacg atcggcacac 24660 

tcagcgggtt catcaccgta atttcacttt ccgcttcgct gggctcttcc tcttcctctt 24720 

gcgtccgcat accacgcgcc actgggtcgt cttcattcag ccgccgcact gtgcgcttac 24780 

ctcctttgcc atgcttgatt agcaccggtg ggttgctgaa acccaccatt tgtagcgcca 24840 

catcttctct ttcttcctcg ctgtccacga ttacctctgg tgatggcggg cgctcgggct 24900 

tgggagaagg gcgcttcttt ttcttcttgg gcgcaatggc caaatccgcc gccgaggtcg 24960 

atggccg c gg gctgggtgtg cgcggcacca gcgcgtcttg tgatgagtct tcctcgtcct 25020 

cggactcgat acgccgcctc atccgctttt ttgggggcgc ccggggaggc ggcggcgacg 25080 

gggacgggga cgacacgtcc tccatggttg ggggacgtcg cgccgcaccg cgtccgcgct 25140 

cgggggtggt ttcgcgctgc tcctcttccc gactggccat ttccttctcc tataggcaga 25200 

aaaagatcat ggagtcagtc gagaagaagg acagcctaac cgccccctct gagttcgcca 25260 

ccaccgcctc caccgatgcc gccaacgcgc ctaccacctt ccccgtcgag gcacccccgc 25320 

ttgaggagga ggaagtgatt atcgagcagg acccaggttt tgtaagcgaa gacgacgagg 25380 

accgctcagt accaacagag gataaaaagc aagaccagga caacgcagag gcaaacgagg 25440 

aacaagtcgg gcggggggac gaaaggcatg gcgactacct agatgtggga gacgacgtgc 25500 

tgttgaagca tctgcagcgc cagtgcgcca ttatctgcga cgcgttgcaa gagcgcagcg 25560 

atgtgcccct cgccatagcg gatgtcagcc ttgcctacga acgccaccta ttctcaccgc 25620 

gcgtaccccc caaacgccaa gaaaacggca catgcgagcc caacccgcgc ctcaacttct 25680 

accccgtatt tgccgtgcca gaggtgcttg ccacctatca catctttttc caaaactgca 25740 

agatacccct atcctgccgt gccaaccgca gccgagcgga caagcagctg gccttgcggc 25800 

agggcgctgt catacctgat atcgcctcgc tcaacgaagt gccaaaaatc tttgagggtc 25860 

ttggacgcga cgagaagcgc gcggcaaacg ctctgcaaca ggaaaacagc gaaaatgaaa 25920 

gtcactctgg agtgttggtg gaactcgagg gtgacaacgc gcgcctagcc gtactaaaac 25980 

gcagcatcga ggtcacccac tttgcctacc cggcacttaa cctacccccc aaggtcatga 26040 

gcacagtcat gagtgagctg atcgtgcgcc gtgcgcagcc cctggagagg gatgcaaatt 26100 

tgcaagaaca aacagaggag ggcctacccg cagttggcga cgagcagcta gcgcgctggc 2616 0 
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agagctccaa gacatggtta acttgcacca gtgcaaaagg ggtatctttt gtctggtaaa 29880 

gcaggccaaa gtcacctacg acagtaatac caccggacac cgccttagct acaagttgcc 29940 

aaccaagcgt cagaaattgg tggtcatggt gggagaaaag cccattacca taactcagca 30000 

ctcggtagaa accgaaggct gcattcactc accttgtcaa ggacctgagg atctctgcac 30060 

ccttattaag accctgtgcg gtctcaaaga tcttattccc tttaactaat aaaaaaaaat 30120 

aataaagcat cacttactta aaatcagtta gcaaatttct gtccagttta ttcagcagca 30180 

cctccttgcc ctcctcccag ctctggtatt gcagcttcct cctggctgca aactttctcc 30240 

acaatctaaa tggaatgtca gtttcctcct gttcctgtcc atccgcaccc actatcttca 30300 

tgttgttgca gatgaagcgc gcaagaccgt ctgaagatac cttcaacccc gtgtatccat 30360 

atgacacgga aaccggtcct ccaactgtgc cttttcttac tcctcccttt gtatccccca 30420 

atgggtttca agagagtccc cctggggtac tctctttgcg cctatccgaa cctctagtta 30480 

cctccaatgg catgcttgcg ctcaaaatgg gcaacggcct ctctctggac gaggccggca 30540 

accttacctc ccaaaatgta accactgtga gcccacctct caaaaaaacc aagtcaaaca 30600 

taaacctgga aatatctgca cccctcacag ttacctcaga agccctaact gtggctgccg 30660 

ccgcacctct aatggtcgcg ggcaacacac tcaccatgca atcacaggcc ccgctaaccg 30720 

tgcacgactc caaacttagc attgccaccc aaggacccct cacagtgtca gaaggaaagc 30780 

tagccctgca aacatcaggc cccctcacca ccaccgatag cagtaccctt actatcactg 30840 

cctcaccccc tctaactact gccactggta gcttgggcat tgacttgaaa gagcccattt 30900 

atacacaaaa tggaaaacta ggactaaagt acggggctcc tttgcatgta acagacgacc 30960 

taaacacttt gaccgtagca actggtccag gtgtgactat taataatact tccttgcaaa 31020 

ctaaagttac tggagccttg ggttttgatt cacaaggcaa tatgcaactt aatgtagcag 31080 

gaggactaag gattgattct caaaacagac gccttatact tgatgttagt tatccgtttg 31140 

atgctcaaaa ccaactaaat ctaagactag gacagggccc tctttttata aactcagccc 31200 

acaacttgga tattaactac aacaaaggcc tttacttgtt tacagcttca aacaattcca 31260 

aaaagcttga ggttaaccta agcactgcca aggggttgat gtttgacgct acagccatag 31320 

ccattaatgc aggagatggg cttgaatttg gttcacctaa tgcaccaaac acaaatcccc 31380 

tcaaaacaaa aattggccat ggcctagaat ttgattcaaa caaggctatg gttcctaaac 31440 

taggaactgg ccttagtttt gacagcacag gtgccattac agtaggaaac aaaaataatg 31500 

ataagctaac tttgtggacc acaccagctc catctcctaa ctgtagacta aatgcagaga 31560 

j aagatgctaa actcactttg gtcttaacaa aatgtggcag tcaaatactt gctacagttt 31620 

cagttttggc tgttaaaggc agtttggctc caatatctgg aacagttcaa agtgctcatc 31680 

ttattataag atttgacgaa aatggagtgc tactaaacaa ttccttcctg gacccagaat 31740 

attggaactt tagaaatgga gatcttactg aaggcacagc ctatacaaac gctgttggat 31800 

ttatgcctaa cctatcagct tatccaaaat ctcacggtaa aactgccaaa agtaacattg 31860 

tcagtcaagt ttacttaaac ggagacaaaa ctaaacctgt aacactaacc attacactaa 31920 

acggtacaca ggaaacagga gacacaactc caagtgcata ctctatgtca ttttcatggg 31980 

actggtctgg ccacaactac attaatgaaa tatttgccac atcctcttac actttttcat 32040 

acattgccca agaataaaga atcgtttgtg ttatgtttca acgtgtttat ttttcaattg 32100 

cagaaaattt caagtcattt ttcattcagt agtatagccc caccaccaca tagcttatac 32160 

agatcaccgt accttaatca aactcacaga accctagtat tcaacctgcc acctccctcc 32220 

caacacacag agtacacagt cctttctccc cggctggcct taaaaagcat catatcatgg 32280 

gtaacagaca tattcttagg tgttatattc cacacggttt cctgtcgagc caaacgctca 32340 

tcagtgatat taataaactc cccgggcagc tcacttaagt tcatgtcgct gtccagctgc 32400 

tgagccacag gctgctgtcc aacttgcggt tgcttaacgg gcggcgaagg agaagtccac 32460 

gcctacatgg gggtagagtc ataatcgtgc atcaggatag ggcggtggtg ctgcagcagc 32520 

gcgcgaataa actgctgccg ccgccgctcc gtcctgcagg aatacaacat ggcagtggtc 32580 

tcctcagcga tgattcgcac cgcccgcagc ataaggcgcc ttgtcctccg ggcacagcag 32640 

cgcaccctga tctcacttaa atcagcacag taactgcagc acagcaccac aatattgttc 32700 

aaaatcccac agtgcaaggc gctgtatcca aagctcatgg cggggaccac agaacccacg 32760 

tggccatcat accacaagcg caggtagatt aagtggcgac ccctcataaa cacgctggac 32820 

ataaacatta cctcttttgg catgttgtaa ttcaccacct cccggtacca tataaacctc 32880 

tgattaaaca tggcgccatc caccaccatc ctaaaccagc tggccaaaac ctgcccgccg 32940 

gctatacact gcagggaacc gggactggaa caatgacagt ggagagccca ggactcgtaa 33000 

ccatggatca tcatgctcgt catgatatca atgttggcac aacacaggca cacgtgcata 33060 

cacttcctca ggattacaag ctcctcccgc gttagaacca tatcccaggg aacaacccat 33120 

tcctgaatca gcgtaaatcc cacactgcag ggaagacctc gcacgtaact cacgttgtgc 33180 

attgtcaaag tgttacattc gggcagcagc ggatgatcct ccagtatggt agcgcgggtt 33240 

tctgtctcaa aaggaggtag acgatcccta ctgtacggag tgcgccgaga caaccgagat 3330 0 

egtgttggtc gtagtgtcat gccaaatgga acgccggacg tagtcatatt tcctgaagca 33360 

aaaccaggtg cgggcgtgac aaacagatct gcgtctccgg tctcgccgct tagatcgctc 33420 

tgtgtagtag ttgtagtata tccactctct caaagcatcc aggcgccccc tggcttcggg 33480 
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aagttcctcg taggtgagct cttcagggga gctgagcccg tgctctgaaa gggcccagtc 8700 
tgcaagatga gggttggaag cgacgaatga gctccacagg tcacgggcca ttagcatttg 8760 
caggtggtcg cgaaaggtcc taaactggcg acctatggcc attttttctg gggtgatgca 8820 
gtagaaggta agcgggtctt gttcccagcg gtcccatcca aggttcgcgg ctaggtctcg 8880 
cgcggcagtc actagaggct catctccgcc gaacttcatg accagcatga agggcacgag 8940 
ctgcttccca aaggccccca tccaagtata ggtctctaca tcgtaggtga caaagagacg 9000 
ctcggtgcga ggatgcgagc cgatcgggaa gaactggatc tcccgccacc aattggagga 9060 
gtggctattg atgtggtgaa agtagaagtc cctgcgacgg gccgaacact cgtgctggct 9120 
tttgtaaaaa cgtgcgcagt actggcagcg gtgcacgggc tgtacatcct gcacgaggtt 9180. 
gacctgacga ccgcgcacaa ggaagcagag tgggaatttg agcccctcgc ctggcgggtt 9240 
tggctggtgg tcttctactt cggctgcttg tccttgaccg tctggctgct cgaggggagt 9300 
tacggtggat cggaccacca cgccgcgcga gcccaaagtc cagatgtccg cgcgcggcgg 9360 
tcggagcttg atgacaacat cgcgcagatg ggagctgtcc atggtctgga gctcccgcgg 9420 
cgtcaggtca ggcgggagct cctgcaggtt tacctcgcat agacgggtca gggcgcgggc 9480 
tagatccagg tgatacctaa tttccagggg ctggttggtg gcggcgtcga tggcttgcaa 9540 
gaggccgcat ccccgcggcg cgactacggt accgcgcggc gggcggtggg ccgcgggggt 9600 
gtccttggat gatgcatcta aaagcggtga cgcgggcgag cccccggagg tagggggggc 9660 
tccggacccg ccgggagagg gggcaggggc acgtcggcgc cgcgcgcggg caggagctgg 9720 
tgctgcgcgc gtaggttgct ggcgaacgcg acgacgcggc ggttgatctc ctgaatctgg 9780 
cgcctctgcg tgaagacgac gggcccggtg agcttgagcc tgaaagagag ttcgacagaa 9840 
tcaatttcgg tgtcgttgac ggcggcctgg cgcaaaatct cctgcacgtc tcctgagttg 9900 
tcttgatagg cgatctcggc catgaactgc tcgatctctt cctcctggag atctccgcgt 9960 
ccggctcgct ccacggtggc ggcgaggtcg ttggaaatgc gggccatgag ctgcgagaag 10020 
gcgttgaggc ctccctcgtt ccagacgcgg ctgtagacca cgcccccttc ggcatcgcgg 10080 
gcgcgcatga ccacctgcgc gagattgagc tccacgtgcc gggcgaagac ggcgtagttt 10140 
cgcaggcgct gaaagaggta gttgagggtg gtggcggtgt gttctgccac gaagaagtac 10200 
ataacccagc gtcgcaacgt ggattcgttg atatccccca aggcctcaag gcgctccatg 10260 
gcctcgtaga agtccacggc gaagttgaaa aactgggagt tgcgcgccga cacggttaac 10320 
tcctcctcca gaagacggat gagctcggcg acagtgtcgc gcacctcgcg ctcaaaggct 10380 
acaggggcct cttcttcttc ttcaatctcc tcttccataa gggcctcccc ttcttcttct 10440 
tctggcggcg gtgggggagg ggggacacgg cggcgacgac ggcgcaccgg gaggcggtcg 10500 
acaaagcgct cgatcatctc cccgcggcga cggcgcatgg tctcggtgac ggcgcggccg 10560 
ttctcgcggg ggcgcagttg gaagacgccg cccgtcatgt cccggttatg ggttggcggg 10620 
gggctgccat gcggcaggga tacggcgcta acgatgcatc tcaacaattg ttgtgtaggt 10680 
actccgccgc cgagggacct gagcgagtcc gcatcgaccg gatcggaaaa cctctcgaga 10740 
aaggcgtcta accagtcaca gtcgcaaggt aggctgagca ccgtggcggg cggcagcggg 10800 
cggcggtcgg ggttgtttct ggcggaggtg ctgctgatga tgtaattaaa gtaggcggtc 10860 
ttgagacggc ggatggtcga cagaagcacc atgtccttgg gtccggcctg ctgaatgcgc 10920 

aggcggtcgg ccatgcccca ggcttcgttt tgacatcggc gcaggtcttt gtagtagtct 10980 

tgcatgagcc tttctaccgg cacttcttct tctccttcct cttgtcctgc atctcttgca 11040 

tctatcgctg cggcggcggc ggagtttggc cgtaggtggc gccctcttcc tcccatgcgt 11100 

gtgaccccga agcccctcat cggctgaagc agggctaggt cggcgacaac gcgctcggct 11160 

aatatggcct gctgcacctg cgtgagggta gactggaagt catccatgtc cacaaagcgg 11220 

tggtatgcgc ccgtgttgat ggtgtaagtg cagttggcca taacggacca gttaacggtc 11280 

tggtgacccg gctgcgagag ctcggtgtac ctgagacgcg agtaagccct cgagtcaaat 11340 

acgtagtcgt tgcaagtccg caccaggtac tggtatccca ccaaaaagtg cggcggcggc 11400 

tggcggtaga ggggccagcg tagggtggcc ggggctccgg gggcgagatc ttccaacata 11460 

aggcgatgat atccgtagat gtacctggac atccaggtga tgccggcggc ggtggtggag 11520 

gcgcgcggaa agtcgcggac gcggttccag atgttgcgca gcggcaaaaa gtgctccatg 11580 

gtcgggacgc tctggccggt caggcgcgcg caatcgttga cgctctagac cgtgcaaaag 11640 

gagagcctgt aagcgggcac tcttccgtgg tctggtggat aaattcgcaa gggtatcatg 11700 

gcggacgacc ggggttcgag ccccgtatcc ggccgtccgc cgtgatccat gcggttaccg 11760 

cccgcgtgtc gaacccaggt gtgcgacgtc agacaacggg ggagtgctcc ttttggcttc 11820 

cttccaggcg cggcggctgc tgcgctagct tttttggcca ctggccgcgc gcagcgtaag 11880 

cggttaggct ggaaagcgaa agcattaagt ggctcgctcc ctgtagccgg agggttattt 11940 

tccaagggtt gagtcgcggg acccccggtt cgagtctcgg accggccgga ctgcggcgaa 12000 

egggggtttg cctccccgtc atgcaagacc ccgcttgcaa attcctccgg aaacagggac 12060 

gagccccttt tttgcttttc ccagatgcat ccggtgctgc ggcagatgcg cccccctcct 12120 

cagcagcggc aagagcaaga gcagcggcag acatgcaggg caccctcccc tcctcctacc 12180 

gcgtcaggag gggcgacatc cgcggttgac gcggcagcag atggtgatta cgaacccccg 12240 

cggcgccggg cccggcacta cctggacttg gaggagggcg agggcctggc gcggctagga 12300 
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cttgttgggc atccgcaagc ggcaaccctt ccaggagggc tttaggatca cctacgatga 16020 
tctggagggt ggtaacattc ccgcactgtt ggatgtggac gcctaccagg cgagcttgaa 16080 
agatgacacc gaacagggcg ggggtggcgc aggcggcagc aacagcagtg gcagcggcgc 16140 
ggaagagaac tccaacgcgg cagccgcggc aatgcagccg gtggaggaca tgaacgatca 16200 
tgccattcgc ggcgacacct ttgccacacg ggctgaggag aagcgcgctg aggccgaagc 16260 
aqcggccgaa gctgccgccc ccgctgcgca acccgaggtc gagaagcctc agaagaaacc 16320 
ggtgatcaaa cccctgacag aggacagcaa gaaacgcagt tacaacctaa taagcaatga 16380 
cagcaccttc acccagtacc gcagctggta ccttgcatac aactacggcg accctcagac 16440 
cggaatccgc tcatggaccc tgctttgcac tcctgacgta acctgcggct cggagcaggt 16500 
ctactggtcg ttgccagaca tgatgcaaga ccccgtgacc ttccgctcca cgcgccagat 16560 
cagcaacttt ccggtggtgg gcgccgagct gttgcccgtg cactccaaga gcttctacaa 16620 
cgaccaggcc gtctactccc aactcatccg ccagtttacc tctctgaccc acgtgttcaa 16680 
tcgctttccc gagaaccaga ttttggcgcg cccgccagcc cccaccatca ccaccgtcag 16740 
tgaaaacgtt cctgctctca cagatcacgg gacgctaccg ctgcgcaaca gcatcggagg 16800 
agtccagcga gtgaccatta ctgacgccag acgccgcacc tgcccctacg tttacaaggc 16860 
cctgggcata gtctcgccgc gcgtcctatc gagccgcact ttttgagcaa gcatgtccat 16920 
ccttatatcg cccagcaata acacaggctg gggcctgcgc ttcccaagca agatgtttgg 16980 
cggggccaag aagcgctccg accaacaccc agtgcgcgtg cgcgggcact accgcgcgcc 17040 
ctggggcgcg cacaaacgcg gccgcactgg gcgcaccacc gtcgatgacg ccatcgacgc 17100 
qqtggtggag gaggcgcgca actacacgcc cacgccgcca ccagtgtcca cagtggacgc 17160 
ggccattcag accgtggtgc gcggagcccg gcgctatgct aaaatgaaga gacggcggag 17220 
gcgcgtagca cgtcgccacc gccgccgacc cggcactgcc gcccaacgcg cggcggcggc 17280 
cctgcttaac cgcgcacgtc gcaccggccg acgggcggcc atgcgggccg ctcgaaggct 17340 
ggccgcgggt attgtcactg tgccccccag gtccaggcga cgagcggccg ccgcagcagc 17400 
cgcggccatt agtgctatga ctcagggtcg caggggcaac gtgtattggg tgcgcgactc 17460 
ggttagcggc ctgcgcgtgc ccgtgcgcac ccgccccccg cgcaactaga ttgcaagaaa 17520 
aaactactta gactcgtact gttgtatgta tccagcggcg gcggcgcgca acgaagctat 17580 
gtccaagcgc aaaatcaaag aagagatgct ccaggtcatc gcgccggaga tctatggccc 17640 
cccgaagaag gaagagcagg attacaagcc ccgaaagcta aagcgggtca aaaagaaaaa 1/700 
gaaagatgat gatgatgaac ttgacgacga ggtggaactg ctgcacgcta ccgcgcccag 17760 
gcgacgggta cagtggaaag gtcgacgcgt aaaacgtgtt ttgcgacccg gcaccaccgt 17820 
agtctttacg cccggtgagc gctccacccg cacctacaag cgcgtgtatg atgaggtgta 17880 
cggcgacgag gacctgcttg agcaggccaa cgagcgcctc ggggagtttg cctacggaaa 17940 
gcggcataag gacatgctgg cgttgccgct ggacgagggc aacccaacac ctagcctaaa 18000 
gcccgtaaca ctgcagcagg tgctgcccgc gcttgcaccg tccgaagaaa agcgcggcct 18060 
aaagcgcgag tctggtgact tggcacccac cgtgcagctg atggtaccca agcgccagcg 18120 
actggaagat gtcttggaaa aaatgaccgt ggaacctggg ctggagcccg aggtccgcgt 18180 
gcggccaatc aagcaggtgg cgccgggact gggcgtgcag accgtggacg ttcagatacc 18240 
cactaccagt agcaccagta ttgccaccgc cacagagggc atggagacac aaacgtcccc 18300 
ggttgcctca gcggtggcgg atgccgcggt gcaggcggtc gctgcggccg cgtccaagac 18360 
ctctacggag gtgcaaacgg acccgtggat gtttcgcgtt tcagcccccc ggcgcccgcg 18420 
cggtt c ga.gg aagtacggcg ccgccagcgc gctactgccc gaatatgccc tacatccttc 18480 
cattgcgcct acccccggct atcgtggcta cacctaccgc cccagaagac gagcaactac 18540 
ccgacgccga accaccactg gaacccgccg ccgccgtcgc cgtcgccagc ccgtgctggc 18600 

cccgatttcc gtgcgcaggg tggctcgcga aggaggcagg accctggtgc tgccaacagc 18660 

gcgctaccac cccagcatcg tttaaaagcc ggtctttgtg gttcttgcag atatggccct 18720 

cacctgccgc ctccgtttcc cggtgccggg attccgagga agaatgcacc gtaggagggg 18780 

catggccggc cacggcctga cgggcggcat gcgtcgtgcg caccaccggc ggcggcgcgc 18840 

gtcgcaccgt cgcatgcgcg gcggtatcct gcccctcctt attccactga tcgccgcggc 18900 

gattggcgcc gtgcccggaa ttgcatccgt ggccttgcag gcgcagagac actgattaaa 18960 

aacaagttgc atgtggaaaa atcaaaataa aaagtctgga ctctcacgct cgcttggtcc 19020 

tgtaactatt ttgtagaatg gaagacatca actttgcgtc tctggccccg cgacacggct 19080 

cgcgcccgtt catgggaaac tggcaagata tcggcaccag caatatgagc ggtggcgcct 19140 

tcagctgggg ctcgctgtgg agcggcatta aaaatttcgg ttccaccgtt aagaactatg 19200 

gcagcaaggc ctggaacagc agcacaggcc agatgctgag ggataagttg aaagagcaaa 19260 

atttccaaca aaaggtggta gatggcctgg cctctggcat tagcggggtg gtggacctgg 19320 

ccaaccaggc agtgcaaaat aagattaaca gtaagcttga tccccgccct cccgtagagg 19380 

agcctccacc ggccgtggag acagtgtctc cagaggggcg tggcgaaaag cgtccgcgcc 19440 

ccgacaggga agaaactctg gtgacgcaaa tagacgagcc tccctcgtac gaggaggcac 19500 

taaaqcaagg cctgcccacc acccgtccca tcgcgcccat ggctaccgga gtgctgggcc 19560 

agcacacacc cgtaacgctg gacctgcctc cccccgccga cacccagcag aaacctgtgc 19620 




WO 03/062400 



PCT/US03/02295 



- 31 - 



gaacagctct 

attaggagcg 

actttcaata 

cttgccgtct 

atggttgggg 

tcttcccgac 

aagaaggaca 

aacgcgccta 

gagcaggacc 

aaaaagcaag 

aggcatggcg 

tgcgccatta 

gtcagccttg 

aacggcacat 

gtgcttgcca 

aaccgcagcc 

gcctcgctca 

gcaaacgctc 

ctcgagggtg 

gcctacccgg 

gtgcgccgtg 

ctacccgcag 

ttggaggagc 

atgcagcggt 

acctttcgac 

ctggtctcct 

acgctcaagg 

tacacctggc 

gagctgcaga 

tccgtggccg 

cagggtctgc 

gagcgctcag 

aagtaccgcg 

taccttgcct 

cactgtcgct 

aacgaaagtc 

gcggctccgg 

gtacctgagg 

aatgcggagc 

atcaacaaag 

cagtccggcg 

cgggcccttg 

ggacgaggag 

catgatggaa 

cgaaacaccg 

ttccagcatg 

caaccgtaga 

agcccaagag 

agttgcttgc 

ccatcacggc 

atactgcacc 

gaccggatag 

gaggagcgct 

tttttcccac 

taaaaaacag 

agcttcggcg 

ttaaggacta 

ggccacaccc 

cctacatgtg 

caacccgaat 

gcgcccaccg 



acagcttcct 

ccacttcttt 

aaggcaaatg 

gcgccgtttg 

gacgtcgcgc 

tggccatttc 

gcctaaccgc 

ccaccttccc 

caggttttgt 

accaggacaa 

actacctaga 

tctgcgacgc 

cctacgaacg 

gcgagcccaa 

cctatcacat 

gagcggacaa 

acgaagtgcc 

tgcaacagga 

acaacgcgcg 

cacttaacct 

cgcagcccct 

ttggcgacga 

gacgcaaact 

tctttgctga 

agggctacgt 

accttggaat 

gcgaggcgcg 

agacggccat 

aactgctaaa 

cgcacctggc 

cagacttcac 

gaatcttgcc 

aatgccctcc 

accactctga 

gcaacctatg 

aaattatcgg 

ggttgaaact 

actaccacgc 

ttaccgcctg 

cccgccaaga 

aggagctcaa 

cttcccagga 

gaatactggg 

gactgggaga 

tcaccctcgg 

gctacaacct 

tgggacacca 

caacaacagc 

ttgcaagact 

gtggccttcc 

ggcggcagcg 

caagactctg 

gcgtctggcg 

tctgtatgct 

gtctctgcga 

cacgctggaa 

gtttcgcgcc 

ggcgccagca 

gagttaccag 

aaactacatg 

aaaccgaatt 



ggagcgccac 

ttgtcacttg 

cttttatttg 

gggaggcggc 

cgcaccgcgt 

cttctcctat 

cccctctgag 

cgtcgaggca 

aagcgaagac 

cgcagaggca 

tgtgggagac 

gttgcaagag 

ccacctattc 

cccgcgcctc 

ctttttccaa 

gcagctggcc 

aaaaatcttt 

aaacagcgaa 

cctagccgta 

accccccaag 

ggagagggat 

gcagctagcg 

aatgatggcc 

cccggagatg 

acgccaggcc 

tttgcacgaa 

ccgcgactac 

gggcgtttgg 

gcaaaacttg 

ggacatcatt 

cagtcaaagc 

cgccacctgc 

gccgctttgg 

cataatggaa 

caccccgcac 

tacctttgag 

cactccgggg 

ccacgagatt 

cgtcattacc 

gtttctgcta 

cccaatcccc 

tggcacccaa 

acagtcaggc 

gcctagacga 

tcgcattccc 

ccgctcctca 

ctggaaccag 

gccaaggcta 

gtgggggcaa 

cccgtaacat 

gcagcggcag 

acaaagccca 

cccaacgaac 

atatttcaac 

tccctcaccc 

gacgcggagg 

ctttctcaaa 

cctgtcgtca 

ccacaaatgg 

agcgcgggac 

ctcttggaac 



tcgccctact 

aaaaacatgt 

tacactctcg 

ggcgacgggg 

ccgcgctcgg 

aggcagaaaa 

ttcgccacca 

cccccgcttg 

gacgaggacc 

aacgaggaac 

gacgtgctgt 

cgcagcgatg 

tcaccgcgcg 

aacttctacc 



aactgcaaga 

ttgcggcagg 

gagggtcttg 

aatgaaagtc 

ctaaaacgca 

gtcatgagca 

gcaaatttgc 

cgctggcttc 

gcagtgctcg 

cagcgcaagc 

tgcaagatct 

aaccgccttg 

gtccgcgact 

cagcagtgct 

aaggacctat 

ttccccgaac 

atgttgcaga 

tgtgcacttc 

ggccactgct 

gacgtgagcg 

cgctccctgg 

ctgcagggtc 

ctgtggacgt 

aggttctacg 

cagggccaca 

cgaaagggac 

ccgccgccgc 

aaagaagctg 

agaggaggtt 

ggaagcttcc 

ctcgccggcg 

ggcgccgccg 

ggccggtaag 

ccgctcatgg 

catctccttc 

cctgcattac 

caacagcagc 

agaaatccac 

ccgtatcgac 

agagcagggg 

gcagctgcct 

ctctcttcag 

tttaagcgcg 

gcgccattat 

gacttgcggc 

cccacatgat 

aggcggctat 



tccgcagcca 

aaaaataatg 

ggtgattatt 

acggggacga 

gggtggtttc 

agatcatgga 

ccgcctccac 

aggaggagga 

gctcagtacc 

aagtcgggcg 

tgaagcatct 

tgcccctcgc 

taccccccaa 

ccgtatttgc 

tacccctatc 

gcgctgtcat 

gacgcgacga 

actctggagt 

gcatcgaggt 

cagtcatgag 

aagaacaaac 

aaacgcgcga 

ttaccgtgga 

tagaggaaac 

ccaacgtgga 

ggcaaaacgt 

gcgtttactt 

tggaggagtg 

ggacggcctt 

gcctgcttaa 

actttaggaa 

ctagcgactt 

accttctgca 

gtgacggtct 

tttgcaattc 

cctcgcctga 

cggcttacct 

aagaccaatc 

ttcttggcca 

gggggg ttta 

agccctatca 

cagctgccgc 

ttggacgagg 

gaggtcgaag 

ccccagaaat 

gcactgcccg 

tccaagcagc 

cgcgggcaca 

gcccgccgct 

taccgtcatc 

ggccacacag 

agcggcggca 

ccgcgagctt 

ccaagaacaa 

gtatcacaaa 

taaatactgc 

aaaactacgt 

gagcaaggaa 

tggagctgcc 

atcccgggtc 

taccaccaca 



cagtgcgcag 

tactagagac 

tacccccacc 

cacgtcctcc 

gcgctgctcc 

gtcagtcgag 

cgatgccgcc 

agtgattatc 

aacagaggat 

gggggacgaa 

gcagcgccag 

catagcggat 

acgccaagaa 

cgtgccagag 

ctgccgtgcc 

acctgatatc 

gaagcgcgcg 

gttggtggaa 

cacccacttt 

tgagctgatc 

agaggagggc 

gcctgccgac 

gcttgagtgc 

attgcactac 

gctctgcaac 

gcttcattcc 

atttctatgc 

caacctcaag 

caacgagcgc 

aaccctgcaa 

ctttatccta 

tgtgcccatt 

gctagccaac 

actggagtgt 

gcagctgctt 

cgaaaagtcc 

tcgcaaattt 

ccgcccgcca 

attgcaagcc 

cttggacccc 

gcagcagccg 

cgccacccac 

aggaggagga 

aggtgtcaga 

cggcaaccgg 

ttcgccgacc 

cgccgccgtt 

agaacgccat 

ttcttctcta 

tctacagccc 

aagcaaaggc 

gcagcaggag 

agaaacagga 

gagctgaaaa 

agcgaagatc 

gcgctgactc 

catctccagc 

attcccacgc 

caagactact 

aacggaatcc 

cctcgtaata 



23340 

23400 

23460 

23520 

23580 

23640 

23700 

23760 

23820 

23880 

23940 

24000 

24060 

24120 

24180 

24240 

24300 

24360 

24420 

24480 

24540 

24600 

24660 

24720 

24780 

24840 

24900 

24960 

25020 

25080 

25140 

25200 

25260 

25320 

25380 

25440 

25500 

25560 

25620 

25680 

25740 

25800 

25860 

25920 

25980 

26040 

26100 

26160 

26220 

26280 

26340 

26400 

26460 

26520 

26580 

26640 

26700 

26760 

26820 

26880 

26940 
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tcacagaacc 
ttctccccgg 
tatattccac 
gggcagctca 
ttgcggttgc 
atcgtgcatc 
ccgctccgtc 
ccgcagcata 
agcacagtaa 
gtatccaaag 
gtagattaag 
gttgtaattc 
caccatccta 
actggaacaa 
gatatcaatg 
ctcccgcgtt 
actgcaggga 
cagcagcgga 
atccctactg 
aaatggaacg 
cagatctgcg 
actctctcaa 
ccgctgccct 
cgttctgcga 
tattccaaaa 
ccggtggcgt 
acaatggctt 
tcagggtgaa 
tctcgccacc 
aaaatctgct 
attcaggttc 
gatcccgtag 
gcgcggccac 
tactcggagc 
ataaaatgca 
tcgtagtcat 
accatttttc 
aaaacattta 
gacggactac 
ccaccgacag 
gtt gat teat 
aggegtagag 
aacacataaa 
acaacataca 
cctattaaaa 
gccaagtgca 
aaaacaccca 
acttcctcaa 
tacaattccc 
cacgccccgc 
ataaggtata 



ctagtattca 

ctggecttaa 

acggtttcct 

ettaagttea 

ttaacgggcg 

aggataggge 

ctgcaggaat 

aggegeettg 

ctgcagcaca 

ctcatggcgg 

tggcgacccc 

accacctccc 

aaccagctgg 

tgacagtgga 

ttggcacaac 

agaaccatat 

agacctcgca 

tgatcctcca 

tacggagtgc 

ccggacgtag 

tctccggtct 

agcatccagg 

gataacatcc 

gtcacacacg 

gattatccaa 

ggtcaaactc 

ccaaaaggca 

tctcctctat 

ttctcaatat 

ccagagcgcc 

ctcacagacc 

gtcccttcgc 

ttccccgcca 

tatgctaacc 

aggtgctgct 

gctcatgcag 

tctcaaacat 

aacattagaa 

ggccatgccg 

ctcctcggtc 

cggtcagtgc 

acaacattac 

cacctgaaaa 

gcgcttcaca 

aaacaccact 

gagegagtat 

gaaaaccgca 

ategteaett 

aacacataca 

gccacgtcac 

ttattgatga 



acctgccacc 
aaagcatcat 
gtcgagccaa 
tgtcgctgtc 
gegaaggaga 
ggtggtgctg 
acaacatggc 
tcctccgggc 
gcaccacaat 
ggaccacaga 
tcataaacac 
ggtaccatat 
ccaaaacctg 
gagcccagga 
acaggcacac 
cccagggaac 
cgtaactcac 
gtatggtagc 
gccgagacaa 
tcatatttcc 
cgccgcttag 
cgccccctgg 
accaccgcag 
ggaggagegg 
aacctcaaaa 
tacagccaaa 
aacggccctc 
aaacattcca 
atetetaage 
ctccaccttc 
tgtataagat 
agggccagct 
ggaaccttga 
agcgtagccc 
caaaaaatca 
ataaaggcag 
gtctgcgggt 
gcctgtctta 
gcgtgaccgt 
atgteeggag 
taaaaagcga 
agcccccata 
accctcctgc 
gcggcagcct 
cgacacggca 
atataggact 
cgcgaaccta 
ccgttttccc 
agttactccg 
aaactccacc 

tg 



tccctcccaa 
atcatgggta 
acgctcatca 
cagctgctga 
agtccacgcc 
cagcagcgcg 
agtggtctcc 
acagcagcgc 
attgttcaaa 
acccacgtgg 
gctggacata 
aaacctctga 
cccgccggct 
ctcgtaacca 
gtgcatacac 
aacccattcc 
gttgtgcatt 
gcgggtttct 
ccgagatcgt 
tgaagcaaaa 
atcgctctgt 
cttcgggttc 
aataagccac 
gaagagctgg 
tgaagateta 
gaacagataa 
acgtccaagt 
gcaccttcaa 
aaatcccgaa 
agcctcaagc 
tcaaaagcgg 
gaacataatc 
caaaagaacc 
egatgtaage 
ggcaaagcct 
gtaagctccg 
ttctgcataa 
caacaggaaa 
aaaaaaactg 
tcataatgta 
ccgaaatagc 
ggaggtataa 
ctaggcaaaa 
aacagtcagc 
ccagctcaat 
aaaaaatgac 
cgcccagaaa 
acgttacgta 
ccctaaaacc 
ccctcattat 



cacacagagt 
acagacatat 
gtgatattaa 
gccacaggct 
tacatggggg 
cgaataaact 
teagegatga 
accctgatct 
atcccacagt 
ccatcatacc 
aacattacct 
ttaaacatgg 
atacactgca 
tggatcatca 
ttcctcagga 
tgaatcagcg 
gtcaaagtgt 
gtctcaaaag 
gttggtcgta 
ccaggtgcgg 
gtagtagttg 
tatgtaaact 
acccagccaa 
aagaaccatg 
ttaagtgaac 
tggcatttgt 
ggacgtaaag 
ccatgcccaa 
tattaagtcc 
agcgaatcat 
aacattaaca 
gtgcaggtct 
cacactgatt 
tttgttgcat 
cgcgcaaaaa 
gaaccaccac 
acacaaaata 
aacaaccctt 
gtcaccgtga 
agactcggta 
ccgggggaat 
caaaattaat 
tagcaccctc 
cttaccagta 
cagtcacagt 
gtaacggtta 
cgaaagccaa 
acttcccatt 
tacgtcaccc 
catattgget 



acacagtcct 

tcttaggtgt 

taaactcccc 

gctgtccaac 

tagagteata 

gctgccgccg 

ttcgcaccgc 

cacttaaatc 

gcaaggcgct 

acaagcgcag 

cttttggcat 

cgccatccac 

gggaaccggg 

tgetegteat 

ttacaagctc 

taaatcccac 

tacattcggg 

gaggtagacg 

gtgtcatgcc 

gcgtgacaaa 

tagtatatcc 

ccttcatgcg 

cctacacatt 

tttttttttt 

gcgctcccct 

aagatgttgc 

gctaaaccct 

ataattetea 

ggccattgta 

gattgcaaaa 

aaaataccgc 

gcacggacca 

atgacacgca 

gggeggegat 

agaaagcaca 

agaaaaagac 

aaataacaaa 

ataageataa 

ttaaaaagca 

aacacatcag 

acatacccgc 

aggagagaaa 

ccgctccaga 

aaaaagaaaa 

gtaaaaaagg 

aagtccacaa 

aaaacccaca 

ttaagaaaac 

gccccgttcc 

tcaatccaaa 



30660 

30720 

30780 

30840 

30900 

30960 

31020 

31080 

31140 

31200 

31260 

31320 

31380 

31440 

31500 

31560 

31620 

31680 

31740 

31800 

31860 

31920 

31980 

32040 

32100 

32160 

32220 

32280 

32340 

32400 

32460 

32520 

32580 

32640 

32700 

32760 

32820 

32880 

32940 

33000 

33060 

33120 

33180 

33240 

33300 

33360 

33420 

33480 

33540 

33600 

33622 



<210> 45 
<211> 1746 
<212> DNA 

<213 > Artificial Sequence 



< 220 > 

<223> 5F KOI 
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Ala Ser 

Lys Glu 

Ala Pro 
210 
Gly Pro 
225 

Gly Ala 

Gly Gly 

Ser Tyr 

Gly Pro 
290 
Lys Gly 
305 

Val Asn 

Ala lie 

Asn Thr 

Ser Asn 
370 
Ser Thr 
385 

Leu Trp 

Lys Asp 

Leu Ala 

Ser Gly 
450 
Gly Val 
465 

Arg Asn 

Phe Met 

Lys Ser 

Pro Val 
530 
Thr Thr 
545 

His Asn 
Tyr lie 



Pro Pro 
180 
Pro lie 
195 

Leu His 

Gly Val 

Leu Gly 

Leu Arg 
260 
Pro Phe 
275 

Leu Phe 

Leu Tyr 

Leu Ser 

Asn Ala 
340 
Asn Pro 
355 

Lys Ala 

Gly Ala 

Thr Thr 

Ala Lys 
420 
Thr Val 
435 

Thr Val 



165 

Leu Thr 

Tyr Thr 

Val Thr 

Thr lie 
230 
Phe Asp 
245 

lie Asp 

Asp Ala 

lie Asn 

Leu Phe 
310 
Thr Ala 
325 

Gly Asp 

Leu Lys 

Met Val 

He Thr 
390 
Pro Ala 
405 

Leu Thr 
Ser Val 
Gin Ser 



Thr Ala 

Gin Asn 
200 
Asp Asp 
215 

Asn Asn 

Ser Gin 

Ser Gin 

Gin Asn 
280 
Ser Ala 
295 

Thr Ala 

Lys Gly ' 

Gly Leu 

Thr Lys 
360 
Pro Lys 
375 

Val Gly 
Pro Glu 
Leu Val 



Leu Leu 

Gly Asp 

Pro Asn 
500 
Asn lie 
515 

Thr Leu 

Pro Ser 

Tyr lie 

Ala Gin 
580 



Asn Asn 
470 
Leu Thr 
485 

Leu Ser 

Val Ser 

Thr lie 

Ala Tyr 
550 
Asn Glu 
565 
Glu 



Leu Ala 
440 
Ala His 
455 

Ser Phe 

Glu Gly 

Ala Tyr 

Gin Val 
520 
Thr Leu 
535 

Ser Met 
lie Phe 



170 
Thr Gly 
185 

Gly Lys 

Leu Asn 

Thr Ser 

Gly Asn 
250 
Asn Arg 
265 

Gin Leu 

His Asn 

Ser Asn 

Leu Met 
330 
Glu Phe 
345 

lie Gly 

Leu Gly 

Asn Lys 

Ala Asn 
410 
Leu Thr 
425 

Val Lys 



Ser Leu 

Leu Gly 

Thr Leu 
220 
Leu Gin 
235 

Met Gin 

Arg Leu 

Asn Leu 

Leu Asp 
300 
Asn Ser 
315 

Phe Asp 
Gly Ser 
His Gly 



Leu lie 

Leu Asp 

Thr Ala 
490 
Pro Lys 
505 

Tyr Leu 

Asn Gly 

Ser Phe 

Ala Thr 
570 



Thr Gly 
380 
Asn Asn 
395 

Cys Arg 

Lys Cys 

Gly Ser 

lie Arg 
460 
Pro Glu 
475 

Tyr Thr 

Ser His 

Asn Gly 

Thr Gin 
540 
Ser Trp 
555 

Ser Ser 



175 

Gly lie Asp Leu 
190 

Leu Lys Tyr Gly 
205 

Thr Val Ala Thr 

Thr Lys Val Thr 
240 

Leu Asn Val Ala 
255 

lie Leu Asp Val 
270 

Arg Leu Gly Gin 
285 

lie Asn Tyr Asn 

Lys Lys Leu Glu 
320 

Ala Thr Ala lie 
335 

Pro Asn Ala Pro 
350 

Leu Glu Phe Asp 
365 

Leu Ser Phe Asp 



Asp Lys Leu Thr 
400 

Leu Asn Ala Glu 
415 

Gly Ser Gin lie 
430 

Leu Ala Pro lie 
445 

Phe Asp Glu Asn 

Tyr Trp Asn Phe 
480 

Asn Ala Val Gly 
495 

Gly Lys Thr Ala 
510 

Asp Lys Thr Lys 
525 

Glu Thr Gly Asp 

Asp Trp Ser Gly 
560 

Tyr Thr Phe Ser 
575 
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Ala Thr Gin Gly Pro Leu Thr Val 
145 150 

Thr Ser Gly Pro Leu Thr Thr Thr 
165 

Ala Ser Pro Pro Leu Thr Thr Ala 
180 

Lys Glu Pro lie Tyr Thr Gin Asn 
195 200 
Ala Pro Leu His Val Thr Asp Asp 
210 215 

Gly Pro Gly Val Thr lie Asn Asn 
225 230 

Gly Ala Leu Gly Phe Asp Ser Gin 
245 

Gly Gly Leu Arg He Asp Ser Gin 
260 

Ser Tyr Pro Phe Asp Ala Gin Asn 
275 280 

Gly Pro Leu Phe lie Asn Ser Ala 

290 295 ' 

Lys Gly Leu Tyr Leu Phe Thr Ala 

305 210 

Val Asn Leu Ser Thr Ala Lys Gly 
325 

Ala lie Asn Ala Gly Asp Gly Leu 
340 

Asn Thr Asn Pro Leu Lys Thr Lys 
355 360 

Ser Asn Lys Ala Met Val Pro Lys 

370 375 

Ser Thr Gly Ala lie Thr Val Gly 

385 390 

Leu Trp Thr Thr Pro Ala Pro Glu 
405 

Lys Asp Ala Lys Leu Thr Leu Val 
420 

Leu Ala Thr Val Ser Val Leu Ala 
435 440 

Ser Gly Thr Val Gin Ser Ala His 

450 455 . 

Gly Val Leu Leu Asn Asn Ser Phe 

465 470 

Arg Asn Gly Asp Leu Thr Glu Gly 
485 

Phe Met Pro Asn Leu Ser Ala Tyr 
500 

Lys Ser Asn lie Val Ser Gin Val 
515 520 

Pro Val Thr Leu Thr lie Thr Leu 

530 535 

His Cys Asp Cys Arg Gly Asp Cys 

545 550 

Ser Met Ser Phe Ser Trp Asp Trp 
565 

lie Phe Ala Thr Ser Ser Tyr Thr 
580 



Ser Glu Gly Lys Leu Ala Leu Gin 
155 160 

Asp Ser Ser Thr Leu Thr lie Thr 
170 175 

Thr Gly Ser Leu Gly lie Asp Leu 
185 IPO 

Gly Lys Leu Gly Leu Lys Tyr Gly 
205 

Leu Asn Thr Leu Thr Val Ala Thr 
220 

Thr Ser Leu Gin Thr Lys Val Thr 
235 240 

Gly Asn Met Gin Leu Asn Val Ala 
250 255 

Asn Arg Arg Leu Xle Leu Asp Val 
265 270 

Gin Leu Asn Leu Arg Leu Gly Gin 
285 

His Asn Leu Asp Xle Asn Tyr Asn 
300 

Ser Asn Asn Ser Lys Lys Leu Glu 
315 320 

Leu Met Phe Asp Ala Thr Ala lie 
330 335 

Glu Phe Gly Ser Pro Asn Ala Pro 
345 350 

lie Gly His Gly Leu Glu Phe Asp 
365 

Leu Gly Thr Gly Leu Ser Phe Asp 
380 

Asn Lys Asn Asn Asp Lys Leu Thr 
395 400 

Ala Asn Cys Arg Leu Asn Ala Glu 
410 415 

Leu Thr Lys Cys Gly Ser Gin lie 
425 430 

Val Lys Gly Ser Leu Ala Pro lie 
445 

Leu lie lie Arg Phe Asp Glu Asn 
460 

Leu Asp Pro Glu Tyr Trp Asn Phe 
475 480 

Thr Ala Tyr Thr Asn Ala Val Gly 
490 495 

Pro Lys Ser His Gly Lys Thr Ala 
505 510 

Tyr Leu Asn Gly Asp Lys Thr Lys 
525 

Asn Gly Thr Gin Glu Thr Gly Asp 
540 

Phe Cys Thr Thr Pro Ser Ala Tyr 
555 560 

Ser Gly His Asn Tyr lie Asn Glu 
570 575 

Phe Ser Tyr lie Ala Gin Glu 
585 590 
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Met Gin 
130 
Ala Thr 
145 

Thr Ser 

Ala Ser 

Lys Glu 

Ala Pro 
210 
Gly Pro 
225 

Gly Ala 

Gly Gly 

Ser Tyr 

Gly Pro 
290 
Lys Gly 
305 

Val Asn 

Ala- lie 

Asn Thr 

Ser Asn 
370 
Ser Thr 
385 

Leu Trp 



115 

Ser Gin Ala 

Gin Gly Pro 

Gly Pro Leu 
165 

Pro Pro Leu 
180 

Pro lie Tyr 
195 

Leu His Val 

Gly Val Thr 

Leu Gly Phe 
245 

Leu Arg He 
260 

Pro Phe Asp 
275 

Leu Phe lie 

Leu Tyr Leu 

Leu Ser Thr 
325 

Asn Ala Gly 
340 

Asn Pro Leu 
355 

Lys Ala Met 
Gly Ala lie 



Gly Asp 

Leu Ala 

Ser Gly 
450 
Gly Val 
465 

Arg Asn 

Phe Met 

Lys Ser 

Pro Val 
530 
Thr Thr 
545 

His Asn 
Tyr lie 



Thr Thr Pro 
405 

Ala Lys Leu 
420 

Thr Val Ser 
435 

Thr Val Gin 

Leu Leu Asn 

Gly Asp Leu 
485 

Pro Asn Leu 
500 

Asn lie Val 
515 

Thr Leu Thr 

Pro Ser Ala 

Tyr lie Asn 
565 

Ala Gin Glu 
580 



Pro Leu 
135 
Leu Thr 
150 

Thr Thr 

Thr Thr 

Thr Gin 

Thr Asp 
215 
lie Asn 
230 

Asp Ser 

Asp Ser 

Ala Gin 

Asn Ser 
295 
Phe Thr 
310 

Ala Lys 

Asp Gly 

Lys Thr 

Val Pro 
375 
Thr Val 
390 

Ala Pro 



120 

Thr Val 

Val Ser 

Thr Asp 

Ala Thr 
185 
Asn Gly 
200 

Asp Leu 

Asn Thr 

Gin Gly 

Gin Asn 
265 
Asn Gin 
280 

Ala His 

Ala Ser 

Gly Leu 

Leu Glu 
345 
Lys He 
360 

Lys Leu 
Gly Asn 
Ser Pro 



His Asp 



Glu Gly 
155 
Ser Ser 
170 

Gly Ser 

Lys Leu 

Asn Thr 

Ser Leu 
235 
Asn Met 
250 

Arg Arg 

Leu Asn 

Asn Leu 

Asn Asn 
315 
Met Phe 
330 

Phe Gly 
Gly His' 
Gly Thr 



Ser 

140 

Lys 

Thr 

Leu 

Gly 

Leu 

220 

Gin 

Gin 

Leu 

Leu 

Asp 

300 

Ser 

Asp 

Ser 

Gly 



Thr Leu 

Val Leu 

Ser Ala 
455 
Asn Ser 
470 

Thr Glu 

Ser Ala 

Ser Gin 

lie Thr 
535 
Tyr Ser 
550 

Glu lie 



Val Leu 
425 
Ala Val 
440 

His Leu 

Phe Leu 

Gly Thr 

Tyr Pro 
505 
Val Tyr 
520 

Leu Asn 
Met Ser 
Phe Ala 



Lys Asn 
3 95 
Asn Cys 
410 

Thr Lys 

Lys Gly 

lie lie 

Asp Pro 
475 
Ala Tyr 
490 

Lys Ser 

Leu Asn 

Gly Thr 

Phe Ser 
555 
Thr Ser 
570 



Gly 

380 

Asn 



125 

Lys Leu 

Leu Ala 

Leu Thr 

Gly lie 
190 
Leu Lys 
205 

Thr Val 

Thr Lys 

Leu Asn 

lie Leu 
270 
Arg Leu 
285 

lie Asn 

Lys Lys 

Ala Thr 

Pro Asn 
350 
Leu Glu 
365 

Leu Ser 



Ser lie 

Leu Gin 
160 
lie Thr 
175 

Asp Leu 

Tyr Gly 

Ala Thr 

Val Thr 
240 
Val Ala 
255 

Asp Val. 

Gly Gin 

Tyr Asn 

Leu Glu 
320 
Ala lie 
335 

Ala Pro 
Phe Asp 
Phe Asp 



Ser 

Cys 

Ser 

Arg 

460 

Glu 

Thr 

His 

Gly 

Gin 

540 

Trp 

Ser 



Asp Lys 

Leu Asn 

Gly Ser 
430 
Leu Ala 
445 

Phe Asp 

Tyr Trp 

Asn Ala 

Gly Lys 
510 
Asp Lys 
525 

Glu Thr 
Asp Trp 
Tyr Thr 



Leu Thr 
400 
Gly Gly 
415 

Gin lie 

Pro lie 

Glu Asn 

Asn Phe 
480 
Val Gly 
495 

Thr Ala 

Thr Lys 

Gly Asp 

Ser Gly 
560 
Phe Ser 
575 
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Ile Ser Ala Pro 
Ala Ala 
Ala Pro 



Pro Leu 
150 
Leu Thr 
165 

Leu Thx 



Thr lie 
230 
Phe Asp 
245 

lie Asp 



He Asn Leu Glu 
100 

Thr Val Ala Ala 
115 

Met Gin Ser Gin 
13 0 

Ala Thr Gin Gly 
145 

Thr Ser Gly Pro 

Ala Ser Pro Pro 
180 

Lys Glu Pro lie Tyr Thr 
195 

Ala Pro Leu His Val Thr 
210 

Gly Pro Gly Val 
225 

Gly Ala Leu Gly 

Gly Gly Leu Arg 
260 

Ser Tyr Pro Phe Asp Ala 
275 

Gly Pro Leu Phe lie Asn 
290 

Lys Gly Leu Tyr 
305 

Val Asn Leu Ser 

Ala lie Asn Ala 
340 

Asn Thr Asn Pro 
355 

Ser Asn Lys Ala 
370 

Ser Thr Gly Ala 
385 

Leu Trp Thr Thr 

Lys Asp Ala Lys 
420 

Leu Ala Thr Val Ser Val 
435 

Ser Gly Thr Val Gin Ser 
450 

Gly Val Leu Leu Asn Asn 
465 470 

Arg Asn Gly Asp Leu Thr 
485 

Phe Met Pro Asn Leu Ser 
500 

Lys Ser Asn lie Val Ser 
515 

Pro Val Thr Leu Thr lie 
530 

Thr Thr Pro Ser 
545 

His Asn Tyr He 



Pro Leu 
120 
Leu Thr 
135 

Thr Val 



Leu Phe 
310 
Thr Ala 
325 

Gly Asp 

Leu Lys 

Met Val 

lie Thr 
390 
Pro Ala 
405 

Leu Thr 



Thr Thr 

Thr Ala 

Gin Asn 
200 
Asp Asp 
215 

Asn Asn 

Ser Gin 

Ser Gin 

Gin Asn 
280 
Ser Ala 
295 

Thr Ala 

Lys Gly 

Gly Leu 

Thr Lys 
360 
Pro Lys 
375 

Val Gly 
Pro Ser 
Leu Val 



Leu Ala 
440 
Ala His 
455 

Ser Phe 

Glu Gly 

Ala Tyr 

Gin Val 
520 
Thr Leu 
535 

Ser Met 



Ala Tyr 
550 

Asn Glu lie Phe 
565 

Tyr lie Ala Gin Glu 



Leu Thr Val Thr Ser Glu Ala Leu 
105 HO 

Met Val Ala Gly Asn Thr Leu Thr 
125 

Val His Asp Ser Lys Leu Ser lie 
140 

Ser Glu Gly Lys Leu Ala Leu Gin 
155 l 60 

Asp Ser Ser Thr Leu Thr lie Thr 

170 175 

Thr Gly Ser Leu Gly lie Asp Leu 

185 19° 

Gly Lys Leu Gly Leu Lys Tyr Gly 
205 

Leu Asn Thr Leu Thr Val Ala Thr 
220 

Thr Ser Leu Gin Thr Lys Val Thr 
235 240 

Gly Asn Met Gin Leu Asn Val Ala 

• 250 255 

Asn Arg Arg Leu lie Leu Asp Val 

265 270 

Gin Leu Asn Leu Arg Leu Gly Gin 
285 

His Asn Leu Asp lie Asn Tyr Asn 
300 

Ser Asn Asn Ser Lys Lys Leu Glu 
315 220 

Leu Met Phe Asp Ala Thr Ala lie 
330 335 

Glu Phe Gly Ser Pro Asn Ala Pro 
345 350 

He Gly His Gly Leu Glu Phe Asp 
365 

Leu Gly Thr Gly Leu Ser Phe Asp 
380 

Asn Lys Asn Asn Asp Lys Leu Thr 
395 400 

Pro Asn Cys Arg Leu Asn Ala Glu 
410 415 

Leu Thr Lys Cys Gly Ser Gin He 
425 430 

Val Lys Gly Ser Leu Ala Pro lie 
445 

Leu lie lie Arg Phe Asp Glu Asn 
460 

Leu Asp Pro Glu Tyr Trp Asn Phe 
475 480 

Thr Ala Tyr Thr Asn Ala Val Gly 
490 495 

Pro Lys Ser His Gly Lys Thr Ala 
505 510 

Tyr Leu Asn Gly Asp Lys Thr Lys 
525 

Asn Gly Thr Gin Glu Thr Gly Asp 
540 

Ser Phe Ser Trp Asp Trp Ser Gly 
555 560 

Ala Thr Ser Ser Tyr Thr Phe Ser 
570 575 
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70 75 

Gin Asn Val Thr Thr Val Ser Pro Pro Leu Gly Ala Gly Ala Ser Asn 
85 90 

He Asn Leu Glu lie Ser Ala Pro Leu Thr Val Thr Ser Glu Ala Leu 
100 105 J-J-u 

Thr Val Ala Ala Ala Ala Pro Leu Met Val Ala Gly Asn Thr Leu Thr 
115 120 125 

Met Gin Ser Gin Ala Pro Leu Thr Val His Asp Ser Lys Leu Ser lie 
1 135 140 

Ala Thr Gin Gly Pro Leu Thr Val Ser Glu Gly Lys Leu Ala Leu Gin 

150 155 -Lou 

Tte Ser Gly Pro Leu Thr Thr Thr Asp Ser Ser Thr Leu Thr lie Thr 
165 170 

Ala Ser Pro Pro Leu Thr Thr Ala Thr Gly Ser Leu Gly lie Asp Leu 
180 185 190 

Lys Glu Pro lie Tyr Thr Gin Asn Gly Lys Leu Gly Leu Lys Tyr Gly 

195 200 Z\Jz> ^ _ 

Ala Pro Leu His Val Thr Asp Asp Leu Asn Thr Leu Thr Val Ala Thr 

oTft 215 220' 

Gly Pro Gly Val Thr lie Asn Asn Thr Ser Leu Gin Thr Lys Val Thr 

230 235 _ 

Gly Ala Leu Gly Phe Asp Ser Gin Gly Asn Met Gin Leu Asn Val Ala 
1 245 250 

Gly Gly Leu Arg lie Asp Ser Gin Asn Arg Arg Leu lie Leu Asp Val 
260 265 2/U- 

Ser Tyr Pro Phe Asp Ala Gin Asn Gin Leu Asn Leu Arg Leu Gly Gin 

275 280 285 

Gly Pro Leu Phe lie Asn Ser Ala His Asn Leu Asp lie Asn Tyr Asn 

290 295 30° _ n 

Lys Gly Leu Tyr Leu Phe Thr Ala Ser Asn Asn Ser Lys Lys Leu Glu 

Val Asn Leu Ser Thr 111 Lys Gly Leu Met Phe Asp Ala Thr Ala He 

325 330 

Ala lie Asn Ala Gly Asp Gly Leu Glu Phe Gly Ser Pro Asn Ala Pro 
340 345 55U 

Asn Thr Asn Pro Leu Lys Thr Lys He Gly His Gly Leu Glu Phe Asp 
355 360 365 

Ser Asn Lys Ala Met Val Pro Lys Leu Gly Thr Gly Leu Ser Phe Asp 

370 375 380 

Ser Thr Gly Ala lie Thr Val Gly Asn Lys Asn Asn Asp Lys Leu Thr 

Leu Trp Thr Thr Pro Ala Pro Ser Pro Asn Cys Arg Leu Asn Ala Glu 

405 410 

Lys Asp Ala Lys Leu Thr Leu Val Leu Thr Lys Cys Gly Ser Gin lie 

^ ^ q ^ 2 

Leu Ala Thr Val Ser Val Leu Ala Val Lys Gly Ser Leu Ala Pro lie 

435 440 445 

Ser Glv Thr Val Gin Ser Ala His Leu lie lie Arg Phe Asp Glu Asn 

450 455 460 , 

Gly Val Leu Leu Asn Asn Ser Phe Leu Asp Pro Glu Tyr Trp Asn Phe 

4frc 470 475 

Arg Asn Gly Asp Leu Thr Glu Gly Thr Ala Tyr Thr Asn Ala Val Gly 

Phe Met Pro Asn Leu Ser Ala Tyr Pro Lys Ser His Gly Lys Thr Ala 

500 505 510 

Lys Ser Asn He Val Ser Gin Val Tyr Leu Asn Gly Asp Lys Thr Lys 

* 515 520 525 

Pro Val Thr Leu Thr lie Thr Leu Asn Gly Thr Gin Glu Thr Gly Asp 

530 555 540 

His Cys Asp Cys Arg Gly Asp Cys Phe Cys Thr Thr Pro Ser Ala Tyr 

545 550 555 
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Leu Arg 
50 

Lys Met 
65 

Gin Asn 

lie Asn 

Thr Val 

Met Gin 
130 
Ala Thr 
145 

Thr Ser 

Ala Ser 

Lys Glu 

Ala Pro 
210 
Gly Pro 
225 

Gly Ala 

Gly Gly 

Ser Tyr 

Gly Pro 
290 
Lys Gly 
305 

Val Asn 

Ala lie 

Asn Thr 

Ser Asn 
370 
Ser Thr 
385 

Leu Trp 

Lys Asp 

Leu Ala 

Ser Gly 
450 
Gly Val 
465 

Arg Asn 
Phe Met 
Lys Ser 
Pro Val 



Leu Ser Glu Pro 
Gly Asn 
Val Thr 



Leu Glu 
100 
Ala Ala 
115 

Ser Gin 

Gin Gly 

Gly Pro 

Pro Pro 
180 
Pro lie 
195 

Leu His 



Gly Leu 
70 

Thr Val 
85 

lie Ser 



Gly Val 

Leu Gly 

Leu Arg 
260 
Pro Phe 
275 

Leu Phe 

Leu Tyr 

Leu Ser 

Asn Ala 
340 
Asn Pro 
355 

Lys Ala 



Ala Ala 

Ala Pro 

Pro Leu 
150 
Leu Thr 
165 

Leu Thr 

Tyr Thr 

Val Thr 

Thr lie 
230 
Phe Asp 
245 

lie Asp 

Asp Ala 

lie Asn 

Leu Phe 
310 
Thr Ala 
325 

Gly Asp 
Leu Lys 
Met' Val 



Leu Val 
55 

Ser Leu 

Ser Pro 

Ala Pro 

Pro Leu 
120 
Leu Thr 
135 

Thr Val 

Thr Thr 

Thr Ala 

Gin Asn 
200 
Asp Asp 
215 

Asn Asn 

Ser Gin 

Ser Gin 

Gin Asn 
280 
Ser Ala 
295 

Thr Ala 



Thr Ser Asn 

Asp Glu Ala 
75 

Pro Leu Gly 
90 

Leu Thr Val 
105 

Met Val Ala 

Val His Asp 

Ser Glu Gly 
.155 

Asp Ser Ser 
170 

Thr Gly Ser 
185 

Gly Lys Leu 

Leu Asn Thr 

Thr Ser Leu 
235 

Gly Asn Met 
250 

Asn Arg Arg 
265 

Gin Leu Asn 
His Asn Leu 



Gly Ala 

Thr Thr 

Ala Lys 
420 
Thr Val 
435 

Thr Val 

Leu Leu 

Gly Asp 

Pro Asn 
500 
Asn lie 
515 

Thr Leu 



lie Thr 
390 
Pro Ala 
405 

Leu Thr 

Ser Val 

Gin Ser 

Asn Asn 
470 
Leu Thr 
485 

Leu Ser 
Val Ser 
Thr lie 



Lys Gly 

Gly Leu 

Thr Lys 
360 
Pro Lys 
375 

Val Gly 

Pro Glu 

Leu Val 

Leu Ala 
440 
Ala His 
455 

Ser Phe 

Glu Gly 

Ala Tyr 

Gin Val 
520 
Thr Leu 



Ser Asn Asn 
315 

Leu Met Phe 
330 

Glu Phe Gly 
345 

lie Gly His 

Leu Gly Thr 

Asn Lys Asn 
395 

Ala Asn Cys 
410 

Leu Thr Lys 
425 

Val Lys Gly 

Leu lie lie 

Leu Asp Pro 
475 

Thr Ala Tyr 
490 

Pro Lys Ser 
505 

Tyr Leu Asn 
Asn Gly Thr 



Gly Met 
60 

Gly Asn 

Ala Gly 

Thr Ser 

Gly Asn 
125 
Ser Lys 
140 

Lys Leu 

Thr Leu 

Leu Gly 

Gly Leu 
205 
Leu Thr 
220 

Gin Thr 

Gin Leu 

Leu lie 

Leu Arg 
285 
Asp lie 
300 

Ser Lys 



Leu Ala Leu 

Leu Thr Ser 
80 

Ala Ser Asn 
95 

Glu Ala Leu 
110 

Thr Leu Thr 

Leu Ser lie 

Ala Leu Gin 
160 

Thr lie Thr 
175 

lie Asp Leu 
190 

Lys Tyr Gly 

Val Ala Thr 

Lys Val Thr 
240 

Asn Val Ala 
255 

Leu Asp Val 
270 

Leu Gly Gin 
Asn Tyr Asn 



Asp Ala 

Ser Pro 

Gly Leu 
365 
Gly Leu 
380 

Asn Asp 
Arg Leu 
Cys Gly 



Lys Leu Glu 
320 

Thr Ala lie 
335 

Asn Ala Pro 
350 

Glu Phe Asp 

Ser Phe Asp 

Lys Leu Thr 
400 

Asn Ala Glu 
415 

Ser Gin lie 
430 

Ala Pro lie 



Ser Leu 
445 

Arg Phe Asp Glu Asn 
460 

Glu Tyr 



Thr Asn 

His Gly 

Gly Asp 
525 
Gin Glu 



Trp Asn Phe 
480 

Ala Val Gly 
495 

Lys Thr Ala 
510 

Lys Thr Lys 
Thr Gly Asp 
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Phe Val Ser Pro Asn Gly Phe Gin 
35 40 

Leu Arg Leu Ser Glu Pro Leu Val 
50 55 

Lys Met Gly Asn Gly Leu Ser Leu 
65 70 

Gin Asn Val Thr Thr Val Ser Pro 
85 

He Asn Leu Glu lie Ser Ala Pro 
100 

Thr Val Ala Ala Ala Ala Pro Leu 
115 120 

Met Gin Ser Gin Ala Pro Leu Thr 
130 135 

Ala Thr Gin Gly Pro Leu Thr Val 
145 15° 

Thr Ser Gly Pro Leu Thr Thr Thr 
165 

Ala Ser Pro Pro Leu Thr Thr Ala 
180 

Lys Glu Pro lie Tyr Thr Gin Asn 
195 200 

Ala Pro Leu His Val Thr Asp Asp 
210 215 

Gly Pro Gly Val Thr lie Asn Asn 
225 230 

Gly Ala Leu Gly Phe Asp Ser Gin 
245 

Gly Gly Leu Arg lie Asp Ser Gin 
260 

Ser Tyr Pro Phe Asp Ala Gin Asn 
275 280 

Gly pro Leu Phe lie Asn Ser Ala 
290 295 

Lys Gly Leu Tyr Leu Phe Thr Ala 
305 310 

Val Asn Leu Ser Thr Ala Lys Gly 
325 

Ala lie Asn Ala Gly Asp Gly Leu 
340 

Asn Thr Asn Pro Leu Lys Thr Lys 
355 360 

Ser Asn Lys Ala Met Val Pro Lys 
370 375 

Ser Thr Gly Ala lie Thr Val Gly 
385 390 

Leu Trp Thr Thr Pro Ala Pro Glu 
405 

Lys Asp Ala Lys Leu Thr Leu Val 
420 

Leu Ala Thr Val Ser Val Leu Ala 
435 440 

Ser Gly Thr Val Gin Ser Ala His 
450 455 

Gly Val Leu Leu Asn Asn Ser Phe 
465 470 

Arg Asn Gly Asp Leu Thr Glu Gly 
485 

Phe Met Pro Asn Leu Ser Ala Tyr 
500 



25 30 

Glu Ser Pro Pro Gly Val Leu Ser 
45 

Thr Ser Asn Gly Met Leu Ala Leu 
60 

Asp Glu Ala Gly Asn Leu Thr Ser 
75 80 

Pro Leu Gly Ala Gly Ala Ser Asn 
90 95 

Leu Thr Val Thr Ser Glu Ala Leu 
105 HO 

Met Val Ala Gly Asn Thr Leu Thr 
125 

Val His Asp Ser Lys Leu Ser lie 
140 

Ser Glu Gly Lys Leu Ala Leu Gin 
155 160 

Asp Ser Ser Thr Leu Thr lie Thr 
170 175 

Thr Gly Ser Leu Gly He Asp Leu 
185 190 

Gly Lys Leu Gly Leu Lys Tyr Gly 
205 

Leu Asn Thr Leu Thr Val Ala Thr 
220 

Thr Ser Leu Gin Thr Lys Val Thr 
235 240 

Gly Asn Met Gin Leu Asn Val Ala 
250 255 

Asn Arg Arg Leu lie Leu Asp Val 
265 270 

Gin Leu Asn Leu Arg Leu Gly Gin 
285 

His Asn Leu Asp lie Asn Tyr Asn 
300 

Ser Asn Asn Ser Lys Lys Leu Glu 
315 320 

Leu Met Phe Asp Ala Thr Ala lie 
330 335 

Glu Phe Gly Ser Pro Asn Ala Pro 
345 350 

lie Gly His Gly Leu Glu Phe Asp 
365 

Leu Gly Thr Gly Leu Ser Phe Asp 
380 

Asn Lys Asn Asn Asp Lys Leu Thr 
395 400 

Ala Asn Cys Arg Leu Asn Ala Glu 
410 ' 415 

Leu Thr Lys Cys Gly Ser Gin lie 
425 430 

Val Lys Gly Ser Leu Ala Pro lie 
445 

Leu lie lie Arg Phe Asp Glu Asn 
460 

Leu Asp Pro Glu Tyr Trp Asn Phe 
475 480 

Thr Ala Tyr Thr Asn Ala Val Gly 
490 495 

Pro Lys Ser His Gly Lys Thr Ala 
505 510 
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Lys Leu 

Asp Ser 
130 
Gin He 
145 

Leu Val 

Val Ser 

Gin Leu 

Ser Asp 
210 
Glu Thr 
225 

Pro Phe 

Cys Tyr 

Ser lie 

He Gin 
290 
lie Ala 
305 

Asp Asp 



100 

Gly Asn Gly 
115 

lie Asn Thr 

Val Glu Asn 

Lys Asn Gly 
165 

Asp Thr Val 
180 

Arg Leu Tyr 
195 

Leu Lys lie 

Val Ala Ser 

Asn Thr Thr 
245 

Tyr Met Thr 
260 

Met Leu Asn 
275 

Phe Glu Trp 
Thr Leu Thr 
Glu 



Leu Lys 

Leu Trp 
135 
Thr Asn 
150 

Gly Leu 

Asn Gin 

Phe Asp 

Pro Leu 
215 
Ser Lys 
23 0 

Thr Arg 

Ser Tyr 

Ser Arg 

Asn Leu 
295 
Thr Ser 
310 



105 

Phe Asn Asn Gly Asp 
120 

Thr Gly 



Thr Asn 

Val Asn 

Met Phe 
185 
Ser Ser 
200 

Lys Asn 

Ala Phe 

Asp Ser 

Asp Arg 
265 
Met lie 
280 

Asn Ala 
Pro Phe 



lie Asn Pro 
14 0 

Asp Gly Lys 
155 

Gly Tyr Val 
170 

Thr Gin Lys 

Gly Asn Leu 

Lys Ser Ser 
220 

Met Pro Ser 
235 

Glu Asn Tyr 
250 

Ser Leu Phe 

Ser Ser Asn 

Ser Glu Ser 
300 

Phe Phe Ser 
315 



110 
lie Cys 
125 

Pro Pro 

Leu Thr 

Ser Leu 

Thr Ala 
190 
Leu Thr 
205 

Thr Ala 

Thr Thr 

lie His 

Pro Leu 
270 
Val Ala 
285 

Pro Glu 
Tyr lie 



lie Lys 

Asn Cys 

Leu Val 
160 
Val Gly 
175 

Asn lie 

Glu Glu 

Thr Ser 

Ala Tyr 
240 
Gly lie 
255 

Asn lie 

Tyr Ala 

Ser Asn 

Thr Glu 
320 



<210> 61 
<211> 1002 
<212> DHA 

<213> Artificial Sequence 



< 220 > 

<223> 35F RGD 



<400> 61 

atgaccaaga 

agcacctccc 

agcccagacg 

ctacagctaa 

aacatacgtg 

aatggattag 

aacaacggtg 

ccacctaact 

ttagtaaaaa 

gtgaaccaaa 

tctggaaatc 

acagcgacca 

cccttcaaca 

actagttatg 

atttcttcca 

tttgaatgga 

tccccctttt 



gagtccggct 

aacacccctt 

gagttcttac 

aagtgggagg 

ctacagcacc 

aaactcaaaa 

acatttgtat 

gtcaaattgt 

atggagggct 

tgttcacaca 

tattaactga 

gtgaaactgt 

ccactactag 

atagaagtct 

atgtacattg 

atctaaatgc 

tcttttctta 



cagtgactcc ttcaaccctg tctaccccta tgaagatgaa 60 
tataaaccca gggtttattt ccccaaatgg cttcacacaa 120 
tttaaaatgt ttaaccccac taacaaccac aggcggatct 180 
gggacttaca gtggatgaca ctgatggtac cttacaagaa 240 
cattactaaa aataatcact ctgtagaact atccattgga 300 
caataaacta tgtgccaaat tgggaaatgg gttaaaattt 360 
aaaggatagt attaacacct tatggactgg aataaaccct 420 
ggaaaacact aatacaaatg atggcaaact tactttagta 480 
tgttaatggc tacgtgtctc tagttggtgt atcagacact 540 
aaagacagca aacatccaat taagattata ttttgactct 600 
ggaatcagac ttaaaaattc cacttaaaaa taaatcttct 660 
agccagcagc aaagccttta tgccaagtac tacagcttat 720 
ggatagtgaa aactacattc atggaatatg ttactacatg 780 
atttcccttg aacatttcta taatgctaaa cagccgtatg 840 
tgattgtcgt ggtgattgtt tttgcgcata tgccatacaa 900 
aagtgaatct ccagaaagca acatagctac gctgaccaca 960 
cattacagaa gacgacgaat aa 100 



<210> 62 
<211> 333 
<212> PRT 
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cccccaggag 

cttactttaa 

cccactgttt 

cccatagctg 

aacaaccagc 

gttaatccta 

99 a 99 aat 9 a 

atcaaccagc 

aacgttaagt 

agcaaccgaa 

aactgctcca 

gctcacgtat 

gataacgctc 

ttggaatcat 

tttatgccca 

atccaaatct 

gcttttactt 

ttttgctaca 



ttttagcact 

aacttggaag 

ctcctcccct 

tatcagctaa 

ttagtattaa 

ccggagcgct 

gagtggacgg 

taacactgcg 

tgggatcagg 

ctcgaagtgt 

tttatgaaac 

taggtactat 

tatctttaaa 

ccacctggcg 

acagtacagt 

cgcctaacat 

ttaaatggtc 

taactgaaga 



caagtacact 

caacataact 

tacaaacagt 

ctctctcaca 

cgcgggcaga 

aggttt c aat 

tgccaactta 

attagaaaac 

cctccaattt 

accatccctc 

ccaagatgca 

aacaatcaaa 

acttcccttt 

ttaccaggaa 

gtatccacga 

caccttcagt 

agccgaaccg 

ataa 



gaccccatta 

ttagaaaatg 

aacaactccc 

ttggccaccg 

ggtttagtta 

aacacaggag 

attcttcatg 

gggttagaag 

gacagtaacg 

actaccattt 

aacctatttc 

ggtcttaaag 

gacaatcagg 

accaacgcag 

aacaaaaccg 

gtcgtctaca 

ggaaaacctt 



ctaccaatgc 

ggttactttc 

tgggtttagc 

ccgcaccact 

taactaacaa 

ctttacaatt 

tagcatatcc 

taaccagcgg 

gacgcattgc 

ggtctatctc 

tttgtctaac 

gagcactgcg 

gaaatttact 

tggcctctaa 

ctcacccggg 

acgagataaa 

ttcacccacc 



taagcatgag 

ggccacagtt 

cacatccgct 

gacagtaagc 

tgccttaaca 

aaatgctgca 

ctttgaagca 

aggaaagctt 

tattagtaat 

gcctacgcct 

taaaaacgga 

ggaaatgcac 

taactgtgcc 

tgccttaaca 

caacatgctc 

cagtgggtat 

taccgctgta 



180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1164 



<210> 64 
<211> 387 
<212 > PRT 

<213> Artificial Sequence 
< 220 > 

<223> 41sF 



<400> 64 
Met Lys Arg 
1 

Asp Thr Phe 

Ser Ser Asp 
35 

Tyr Thr Asp 
50 

Leu Gly Ser 
65 

Pro Thr Val 

Ala Thr Ser 

Thr Ala Ala 
115 

Gly Arg Gly 
130 

Gly Ala Leu 
145 

Gly Gly Met 

Pro Phe Glu 

Glu Val Thr 
195 

Gin Phe Asp 
210 

Arg Ser Val 
225 

Asn Cys Ser 



Thr Arg 
5 

Ser Thr 
20 

Gly Leu 



lie Glu Asp Asp 
Pro Ser 
Gin Glu 



Pro lie 

Asn lie 

Ser Pro 
85 

Ala Pro 
100 

Pro Leu 

Leu Val 

Gly Phe 

Arg Val 
165 
Ala He 
180 

Ser Gly 

Ser Asn 

Pro Ser 

lie Tyr 
245 
Gly Ala 



Thr Thr 
55 

Thr Leu 
70 

Pro Leu 

lie Ala 

Thr Val 

lie Thr 
135 
Asn Asn 
150 

Asp Gly 
Asn Gin 
Gly Lys 



lie Pro 
25 

Lys Pro 
40 

Asn Ala 



Gly Arg 
215 
Leu Thr 
230 

Glu Thr 



Glu Asn 

Thr Asn 

Val Ser 
105 
Ser Asn 
120 

Asn Asn 

Thr Gly 

Ala Asn 

Leu Thr 
185 
Leu Asn 
200 

lie Ala 



Phe Asn 
10 

Tyr Val 

Pro Gly 

Lys His 

Gly Leu 
75 

Ser Asn 
90 

Ala Asn 

Asn Gin 

Ala Leu 

Ala Leu 
155 
Leu lie 
170 

Leu Arg 
Val Lys 
lie Ser 



Thr lie 
Gin Asp 
His Val Leu Gly 



Trp Ser 
235 
Ala Asn 
250 

Thr lie 



Pro Val 

Ala Pro 

Val Leu 
45 

Glu Leu 
60 

Leu Ser 

Asn Ser 

Ser Leu 

Leu Ser 
125 
Thr Val 
140 

Gin Leu 

Leu His 

Leu Glu 

Leu Gly 
205 
Asn Ser 
220 

lie Ser 



Tyr Pro Tyr 
15 

Pro Phe Val 
30 

Ala Leu Lys 

Thr Leu Lys 

Ala Thr Val 
80 

Leu Gly Leu 
95 

Thr Leu Ala 
110 

lie Asn Ala 

Asn Pro Thr 

Asn Ala Ala 
160 

Val Ala Tyr 
175 

Asn Gly Leu 
190 

Ser Gly Leu 
Asn Arg Thr 



Leu Phe 
Thr lie 



Pro Thr Pro 
240 

Leu Cys Leu 
255 

Lys Gly Leu 



Thr Lys Asn 
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Ser Ser Asp Gly 
35 

Tyr Thr Asp Pro 
50 

Leu Gly Ser Asn 
65 

Pro Thr Val Ser 

Ala Thr Ser Ala 
100 

Thr Ala Ala Pro 
115 

Gly Arg Gly I>eu 
13 0 

Gly Ala Leu Gly 
145 

Gly Gly Met Arg 

Pro Phe Glu Ala 
180 

Glu Val Thr Ser 
195 

Gin Phe Asp Ser 
210 

Arg Ser Val Pro 
225 

Asn Cys Ser He 

Thr Lys Asn Gly 
260 

Lys Gly Ala Leu 
275 

Pro Phe Asp Asn 
290 

Thr Trp Arg Tyr 
305 

Phe Met Pro Asn 

Gly Asn Met Leu 
340 

Tyr Asn Glu lie 
355 

Gly Tyr Ala Phe 
370 

His Pro Pro Thr 
385 



25 30 

Leu Gin Glu Lys Pro Pro Gly Val Leu Ala Leu Lys 
40 45 

lie Thr Thr Asn Ala Lys His Glu Leu Thr Leu Lys 
55 6 0 

Leu Glu Asn Gly Leu Leu Ser Ala Thr Val 
75 80 

Leu Thr Asn Ser Asn Asn Ser Leu Gly Leu 
90 95 

Ala Val Ser Ala Asn Ser Leu Thr Leu Ala 

105 110 i 

Leu T hr Val Ser Asn Asn Gin Leu Ser lie Asn Ala 
120 125 

Val lie Thr Asn Asn Ala Leu Thr Val Asn Pro Thr 
135 140 

Asn Thr Gly Ala Leu Gin Leu Asn Ala Ala 
155 160 

Gly Ala Asn Leu lie Leu His Val Ala Tyr 
170 175 

Gin Leu Thr Leu Arg Leu Glu Asn Gly Leu 
185 -*-^0 

Lys Leu Asn Val Lys Leu Gly Ser Gly Leu 
200 205 

Arg He Ala lie Ser Asn Ser Asn Arg Thr 
215 220 

Thr Thr lie Trp Ser lie Ser Pro Thr Pro 
235 240 

Thr Gin Asp Ala Asn Leu Phe Leu Cys Leu 

250 255 

Val Leu Gly Thr lie Thr lie Lys Gly Leu 

265 270 

Met His Asp Asn Ala Leu Ser Leu Lys Leu 

280 285 , 

Asn Leu Leu Asn Cys Ala Leu Glu Ser Ser 

295 300 

Thr Asn Ala Val Ala Ser Asn Ala Leu Thr 

315 . 320 

Val Tyr Pro Arg Asn Lys Thr Ala His Pro 

330 335 

lie Ser Pro Asn lie Thr Phe Ser Val Val 

345 350 

Asp Cys Arg Gly Asp Cys Phe Cys Thr Ser 

360 365 ^ 

Lys Trp Ser Ala Glu Pro Gly Lys Pro Phe 

375 380 

Ala Val Phe Cys Tyr lie Thr Glu Glu 
390 385 



lie Thr 
70 

Pro Pro 
85 

Pro lie 



Phe Asn 
150 
Val Asp 
165 

lie Asn 

Gly Gly 

Asn Gly 

Ser Leu 
230 
Tyr Glu 
245 

Ala His 

Arg Glu 

Gin Gly 

Gin Glu 
310 
Ser Thr 
325 

He Gin 
Asn Cys 
Thr Phe 



<210> 67 
<211> 1737 
<212 > DNA 

<213> Artificial Sequence 



< 220 > 

<223> Ad5 PD1 penton 



< 400> 67 tqaggaaggt cctcctccct cctacgagag tgtggtgagc 60 

atgcggcgcg cggcgag | t || gt 9| t cccttcga tg ctcccctgga cccgccgttt 120 

gtgcctccgc gftacctgcg gcctaccggg gggagaaaca gcatccgtta ctctgagttg 8 
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Lys Phe 
210 
Leu Val 
225 

lie Leu 

Asn Leu 

lie Thr 

Val Asp 
290 
Gly Gly 
305 

Ser Asn 

Ser Arg 

Thr Arg 

Ala Ala 
370 
Val lie 
385 

lie Ser 

Tyr Asn 

Cys Thr 

Pro Asp 
450 
Ser Asn 
465 

Ser Phe 

Thr Ser 

Ala Arg 

Ala Leu 
530 
Val Gin 
545 

Val Tyr 
Thr Phe 



195 

Asp Thr Arg 
Met Pro Gly 



200 

Asn Phe Arg 
215 

Val Tyr Thr 
230 

Cys Gly Val 



Leu Pro Gly 
245 

Leu Gly He Arg Lys Arg 
260 

Tyr Asp Asp 
275 

Ala Tyr Gin 



Leu Gly Phe 

Asn Glu Ala 
235 

Asp Phe Thr 
250 

Gin Pro Phe 
265 

Gly Asn lie 



Leu Glu Gly 
280 

Ala Ser Leu Lys Asp Asp 
295 

Ala Gly Gly Ser Asn Ser 
310 

Ala Ala Met 



Ala Ala Ala 
325 

Gly Tyr Pro 
340 

Ala Glu Glu 
355 

Pro Ala Ala 

Lys Pro Leu 

Asn Asp Ser 
405 

Tyr Gly Asp 
420 

Pro Asp Val 
435 

Met Met Gin 

Phe Pro Val 

Tyr Asn Asp 
485 

Leu Thr His 
500 

Pro Pro Ala 
515 

Thr Asp His 

Arg Val Thr 

Lys Ala Leu 
565 



Tyr Asp Val 

Lys Arg Ala 
360 

Gin Pro Glu 
375 

Thr Glu Asp 
390 

Thr Phe Thr 

Pro Gin Thr 

Thr Cys Gly 
440 

Asp Pro Val 
455 

Val Gly Ala 
470 

Gin Ala Val 

Val Phe Asn 

Pro Thr lie 
520 

Gly Thr Leu 
535 

lie Thr Asp 
550 

Gly lie Val 



Ser Gly Ser 
315 

Gin Pro Val 
330 

Pro Asp Tyr 
345 

Glu Ala Glu 

Val Glu Lys 

Ser Lys Lys 
395 

Gin Tyr Arg 
410 

Gly lie Arg 
425 

Ser Glu Gin 

Thr Phe Arg 

Glu Leu Leu 
475 

Tyr Ser Gin 
490 

Arg Phe Pro 
505 

Thr Thr Val 

Pro Leu Arg 

Ala Arg Arg 
555 

Ser Pro Arg 
570 



205 
Asp Pro 
220 

Phe His 

His Ser 

Gin Glu 

Pro Ala 
285 
Thr Glu 
300 

Gly Ala 

Glu Asp 

Ala Gly 

Ala Ala 
365 
Pro Gin 
380 

Arg Ser 

Ser Trp 

Ser Trp 

Val Tyr 
445 
Ser Thr 
460 

Pro Val 

Leu lie 

Glu Asn 

Ser Glu 
525 
Asn Ser 
540 . 
Arg Thr 

Val Leu 



Val Thr Gly 



Pro Asp lie 
240 

Arg Leu Ser 
255 

Gly Phe Arg 
270 

Leu Leu Asp 

Gin Gly Gly 

Glu Glu Asn 
320 

Met Asn Asp 
335 

Thr Ser Ala 
350 

Ala Glu Ala 

Lys Lys Pro 

Tyr Asn Leu 
400 

Tyr Leu Ala 
415 

Thr Leu Leu 
430 

Trp Ser Leu 

Arg Gin lie 

His Ser Lys 
480 

Arg Gin Phe 
495 

Gin lie Leu 
510 

Asn Val Pro 

lie Gly Gly 

Cys Pro Tyr 
560 

Ser Ser Arg 
575 



<210> 69 
<211> 1773 
<212> DNA 

<213> Artificial Sequence 
< 220 > 

<223> 5TS35H 

atgaagcgcg caagaccgtc tgaagatacc ttcaaccccg tgtatccata tgacacggaa 60 
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Ala Ser Pro Pro Leu Thr Thr Ala Thr Gly Ser Leu Gly lie Asp Leu 

Lys Glu Pro lie Tyr Thr Gin Asn Gly Lys Leu Gly Leu Lys Tyr Gly 

Ala Pro III His Val Thr Asp Asp Leu Asn Thr Leu Thr Val Ala Thr 

Gly III Gly val Thr He Asn Asn Thr Ser Leu Gin Thr Lys Val Thr 

Gly Ala Leu Gly Phe ^p Ser Gin Gly Asn Met Gin Leu Asn Val Ala 

Gly Gly Leu Arg He Asp Ser Gin J-n Arg Arg Leu He Leu Asp Val 

Ser Tyr Pro Phe Asp Ala Gin Asn Gin Leu Asn Leu Ar| Leu Gly Gin 

Gly Pro III Phe He Asn Ser Ala His Asn Leu Asp He Asn Tyr Asn 

Lys Gly Leu Tyr Leu Phe Tto Ala Ser Asn Asn Ser Lys Lys Leu Glu 

val Asn Leu Ser Thr III Lys Gly Leu Met Phe Asp Ala Thr Ala He 

Ala He Asn Ala Sy Asp Gly Leu Glu Phe Gly Ser Pro Asn Ala Pro 

Asn Thr Asn lH Leu Lys Thr g; He Gly His Gly Leu Glu Phe Asp 

ser Asn lys Ala Met Val Pro Lys Leu Gly Thr Gly Leu Ser Phe Asp 

ser Tte Gly Ala He Thr Val Gly Asn Lys Asn Asn Asp Lys Leu Thr 

Leu Trp Thr Gly He Pro Pro Pro Asn Cys Gin He Val Glu Asn 

Thr Asn Thr Asn Jsp Gly Lys Leu Thr Leu Val Leu Val Lys Asn Gly 

Gly Leu Val Asn Gly Tyr Val Ser Leu Val Gly Val Ser Asp Thr Val 

Asn Gin Met Phe Thr Gin Lys Thr Ala Asn He Gin Leu Arg Leu Tyr 

Phe tS> Ser Ser Gly Asn Leu Leu Thr Glu Glu Ser Asp Leu Lys He 

Pro Leu Lys Asn Lys Ser Ser Thr Ala Thr Ser Glu Thr Val Ala Ser 

Ser Lys Ala Phe Met Pro Ser Thr Thr Ala Tyr Pro Phe Asn Thr Thr 

Thr Arg Asp Glu Asn Tyr lie His Gly He Cys Tyr Tyr Met Thr 

Ser Tyr Asp Arg Ser Leu Phe Pro Leu Asn He jjr He Met Leu Asn 

Ser Sg Met He Ser Ser Asn Val Ala Tyr Ala He Gin Phe Glu Trp 

Leu Asn Ala Ser Glu Ser Pro Glu Ser Asn He Ala Thr Leu Thr 

Thr Ser Pro Phe Phe Phe Ser Tyr lie Thr Glu Asp Asp Glu 
580 585 



<210> 71 
<211> 945 
<212 > DNA t 

<213> Artificial 



Sequence 



< 220 > 

<223> 35TS5H 
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Gln Glu Thr Gly Asp Thr Thr Pro Ser Ala Tyr Ser Met Ser Phe Ser 
275 280 285 

Tro Asp Trp Ser Gly His Asn Tyr lie Asn Glu lie Phe Ala Thr Ser 
290 295 300 

Ser Tyr Thr Phe Ser Tyr lie Ala Gin Glu 
305 310 
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